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Preface 



This book is primarily intended to accompany an undergraduate course 
in atomic physics. It covers the core material and a selection of more 
advanced topics that illustrate current research in this field. The first 
six chapters describe the basic principles of atomic structure, starting 
in Chapter 1 with a review of the classical ideas. Inevitably the dis- 
cussion of the structure of hydrogen and helium in these early chapters 
has considerable overlap with introductory quantum mechanics courses, 
but an understanding of these simple systems provides the basis for the 
treatment of more complex atoms in later chapters. Chapter 7 on the 
interaction of radiation with atoms marks the transition between the 
earlier chapters on structure and the second half of the book which cov- 
ers laser spectroscopy, laser cooling, Bose-Einstein condensation of di- 
lute atomic vapours, matter- wave interferometry and ion trapping. The 
exciting new developments in laser cooling and trapping of atoms and 
Bose-Einstein condensation led to Nobel prizes in 1997 and 2001, respec- 
tively. Some of the other selected topics show the incredible precision 
that has been achieved by measurements in atomic physics experiments. 
This theme is taken up in the final chapter that looks at quantum infor- 
mation processing from an atomic physics perspective; the techniques 
developed for precision measurements on atoms and ions give exquisite 
control over these quantum systems and enable elegant new ideas from 
quantum computation to be implemented. 

The book assumes a knowledge of quantum mechanics equivalent to an 
introductory university course, e.g. the solution of the Schrodinger equa- 
tion in three dimensions and perturbation theory. This initial knowledge 
will be reinforced by many examples in this book; topics generally re- 
garded as difficult at the undergraduate level are explained in some de- 
tail, e.g. degenerate perturbation theory. The hierarchical structure of 
atoms is well described by perturbation theory since the different layers 
of structure within atoms have considerably different energies associated 
with them, and this is reflected in the names of the gross, fine and hyper- 
fine structures. In the early chapters of this book, atomic physics may 
appear to be simply applied quantum mechanics, i.e. we write down the 
Hamiltonian for a given interaction and solve the Schrodinger equation 
with suitable approximations. I hope that the study of the more ad- 
vanced material in the later chapters will lead to a more mature and 
deeper understanding of atomic physics. Throughout this book the ex- 
perimental basis of atomic physics is emphasised and it is hoped that 
the reader will gain some factual knowledge of atomic spectra. 
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The selection of topics from the diversity of current atomic physics 
is necessarily subjective. I have concentrated on low-energy and high- 
precision experiments which, to some extent, reflects local research in- 
terests that are used as examples in undergraduate lectures at Oxford. 
One of the selection criteria was that the material is not readily avail- 
able in other textbooks, at the time of writing, e.g. atomic collisions 
have not been treated in detail (only a brief summary of the scattering 
of ultracold atoms is included in Chapter 10). Other notable omissions 
include: X-ray spectra, which are discussed only briefly in connection 
with the historically important work of Moseley, although they form an 
important frontier of current research; atoms in strong laser fields and 
plasmas; Rydberg atoms and atoms in doubly- and multiply-excited 
states (e.g. excited by new synchrotron and free-electron laser sources); 
and the structure and spectra of molecules. 

I would like to thank Geoffrey Brooker for invaluable advice on physics 
(in particular Appendix B) and on technical details of writing a textbook 
for the Oxford Master Series. Keith Burnett, Jonathan Jones and An- 
drew Steane have helped to clarify certain points, in my mind at least, 
and hopefully also in the text. The series of lectures on laser cooling 
given by William Phillips while he was a visiting professor in Oxford was 
extremely helpful in the writing of the chapter on that topic. The fol- 
lowing people provided very useful comments on the draft manuscript: 
Rachel Godun, David Lucas, Mark Lee, Matthew McDonnell, Martin 
Shotter, Claes-Goran Wahlstrom (Lund University) and the (anony- 
mous) reviewers. Without the encouragement of Sonke Adlung at OUP 
this project would not have been completed. Irmgard Smith drew some 
of the diagrams. I am very grateful for the diagrams and data supplied 
by colleagues, and reproduced with their permission, as acknowledged 
in the figure captions. Several of the exercises on atomic structure de- 
rive from Oxford University examination papers and it is not possible to 
identify the examiners individually — some of these exam questions may 
themselves have been adapted from some older sources of which I am 
not aware. 

Finally, I would like to thank Professors Derek Stacey, Joshua Silver 
and Patrick Sandars who taught me atomic physics as an undergraduate 
and graduate student in Oxford. I also owe a considerable debt to the 
book on elementary atomic structure by Gordon Kemble Woodgate, who 
was my predecessor as physics tutor at St Peter’s College, Oxford. In 
writing this new text, I have tried to achieve the same high standards 
of clarity and conciseness of expression whilst introducing new examples 
and techniques from the laser era. 

Background reading 

It is not surprising that our language should be incapable of 
describing the processes occurring with the atoms, for it was 
invented to describe the experiences of daily life, and these 
consist only of processes involving exceeding large numbers 



of atoms. Furthermore, it is very difficult to modify our 
language so that it will be able to describe these atomic pro- 
cesses, for words can only describe things of which we can 
form mental pictures, and this ability, too, in the result of 
daily experience. Fortunately, mathematics is not subject to 
this limitation, and it has been possible to invent a mathe- 
matical scheme — the quantum theory — which seems entirely 
adequate for the treatment of atomic processes. 

From The physical principles of the quantum theory , Werner 
Heisenberg (1930). 

The point of the excerpt is that quantum mechanics is essential for a 
proper description of atomic physics and there are many quantum me- 
chanics textbooks that would serve as useful background reading for this 
book. The following short list includes those that the author found par- 
ticularly relevant: Mandl (1992), Rae (1992) and Griffiths (1995). The 
book Atomic spectra by Softley (1994) provides a concise introduction to 
this field. The books Cohen- Tannoudji et al. (1977), Atkins (1983) and 
Basdevant and Dalibard (2000) are very useful for reference and contain 
many detailed examples of atomic physics. Angular- momentum theory 
is very important for dealing with complicated atomic structures, but 
it is beyond the intended level of this book. The classic book by Dirac 
(1981) still provides a very readable account of the addition of angular 
momenta in quantum mechanics. A more advanced treatment of atomic 
structure can be found in Condon and Odabasi (1980), Cowan (1981) 
and Sobelman (1996). 

Oxford C.J.F. 



Web site: 

http : //www. physics . ox. ac .uk/users/f oot 

This site has answers to some of the exercises, corrections and other 
supplementary information. 
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Early atomic physics 




1.1 Introduction 

The origins of atomic physics were entwined with the development of 
quantum mechanics itself ever since the first model of the hydrogen 
atom by Bohr. This introductory chapter surveys some of the early 
ideas, including Einstein’s treatment of the interaction of atoms with 
radiation, and a classical treatment of the Zeeman effect. These meth- 
ods, developed before the advent of the Schrodinger equation, remain 
useful as an intuitive way of thinking about atomic structure and tran- 
sitions between the energy levels. The ‘proper’ description in terms of 
atomic wavefunctions is presented in subsequent chapters. 

Before describing the theory of an atom with one electron, some ex- 
perimental facts are presented. This ordering of experiment followed 
by explanation reflects the author’s opinion that atomic physics should 
not be presented as applied quantum mechanics, but it should be mo- 
tivated by the desire to understand experiments. This represents what 
really happens in research where most advances come about through the 
interplay of theory and experiment. 
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1.2 Spectrum of atomic hydrogen 

It has long been known that the spectrum of light emitted by an element 
is characteristic of that element, e.g. sodium in a street lamp, or burn- 
ing in a flame, produces a distinctive yellow light. This crude form of 
spectroscopy, in which the colour is seen by eye, formed the basis for a 
simple chemical analysis. A more sophisticated approach using a prism, 
or diffraction grating, to disperse the light inside a spectrograph shows 
that the characteristic spectrum for atoms is composed of discrete lines 
that are the ‘fingerprint’ of the element. As early as the 1880s, Fraun- 
hofer used a spectrograph to measure the wavelength of lines, that had 
not been seen before, in light from the sun and he deduced the exis- 
tence of a new element called helium. In contrast to atoms, the spectra 
of molecules (even the simplest diatomic ones) contain many closely- 
spaced lines that form characteristic molecular bands; large molecules, 
and solids, usually have nearly continuous spectra with few sharp fea- 
tures. In 1888, the Swedish professor J. Rydberg found that the spectral 
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lr rhe Swiss mathematician Johann 
Balmer wrote down an expression 
which was a particular case of eqn 1.1 
with n = 2, a few years before Jo- 
hannes (commonly called Janne) Ry- 
dberg found the general formula that 
predicted other series, 
o 

A spectrum of the Balmer series of 
lines is on the cover of this book. 



q 

In this book transitions are also spec- 
ified in terms of their frequency (de- 
noted by / so that / = ci > ), or in elec- 
tron volts (eV) where appropriate. 



4 Air absorbs radiation at wavelengths 
shorter than about 200 nm and so 
spectrographs must be evacuated, as 
well as being made with special optics. 



lines in hydrogen obey the following mathematical formula: 




(i.i) 



where n and n' are whole numbers; R is a constant that has become 
known as the Rydberg constant. The series of spectral lines for which 
n = 2 and n' = 3,4,... is now called the Balmer series and lies in the 
visible region of the spectrum. 1 The first line at 656 nm is called the 
Balmer-<a (or H a ) line and it gives rise to the distinctive red colour of 
a hydrogen discharge — a healthy red glow indicates that most of the 
molecules of H 2 have been dissociated into atoms by being bombarded 
by electrons in the discharge. The next line in the series is the Balmer-/^ 
line at 486 nm in the blue and subsequent lines at shorter wavelengths 
tend to a limit in the violet region. 2 To describe such series of lines it is 
convenient to define the reciprocal of the transition wavelength as the 
wavenumber v that has units of m -1 (or often cm -1 ), 




Wavenumbers may seem rather old-fashioned but they are very useful 
in atomic physics since they are easily evaluated from measured wave- 
lengths without any conversion factor. In practice, the units used for 
a given quantity are related to the method used to measure it, e.g. 
spectroscopes and spectrographs are calibrated in terms of wavelength. 3 
A photon with wavenumber v has energy E = hci>. The Balmer for- 
mula implicitly contains a more general empirical law called the Ritz 
combination principle that states: the wavenumbers of certain lines in 
the spectrum can be expressed as sums (or differences) of other lines: 
z >3 = v\ e.g. the wavenumber of the Balmer-/^ line (n = 2 to n' = 4) 
is the sum of that for Balmer-n (n = 2 to n' = 3) and the first line in 
the Paschen series (n m 3 to ml = 4). Nowadays this seems obvious 
since we know about the underlying energy-level structure of atoms but 
it is still a useful principle for analyzing spectra. Examination of the 
sums and differences of the wavenumbers of transitions gives clues that 
enable the underlying structure to be deduced, rather like a crossword 
puzzle — some examples of this are given in later chapters. The observed 
spectral lines in hydrogen can all be expressed as differences between 
energy levels, as shown in Fig. 1.1, where the energies are proportional 
to 1/n 2 . Other series predicted by eqn 1.1 were more difficult to observe 
experimentally than the Balmer series. The transitions to n = 1 give 
the Lyman series in the vacuum ultraviolet region of the spectrum. 4 The 
series of lines with wavelengths longer than the Balmer series he in the 
infra-red region (not visible to the human eye, nor readily detected by 
photographic film — the main methods available to the early spectroscop- 
ists). The following section looks at how these spectra can be explained 
theoretically. 
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Fig. 1.1 The energy levels of the hydro- 
gen atom. The transitions from higher 
shells n' = 2, 3, 4, . . . down to the n = 1 
shell give the Lyman series of spectral 
lines. The series of lines formed by 
transitions to other shells are: Balmer 
(n = 2), Paschen (n = 3), Brack- 
ett ( n = 4) and Pfund ( n = 5) (the 
last two are not labelled in the figure). 
Within each series the lines are denoted 
by Greek letters, e.g. L a for n = 2 to 
n — 1 and for n — 4 to n = 2. 



1.3 Bohr’s theory 

In 1913, Bohr put forward a radical new model of the hydrogen atom 
using quantum mechanics. It was known from Rutherford’s experiments 
that inside atoms there is a very small, dense nucleus with a positive 
charge. In the case of hydrogen this is a single proton with a single elec- 
tron bound to it by the Coulomb force. Since the force is proportional 
to 1/r 2 , as for gravity, the atom can be considered in classical terms as 
resembling a miniature solar system with the electron orbiting around 
the proton, just like a planet going around the sun. However, quantum 
mechanics is important in small systems and only certain electron orbits 
are allowed. This can be deduced from the observation that hydrogen 
atoms emit light only at particular wavelengths corresponding to tran- 
sitions between discrete energies. Bohr was able to explain the observed 
spectrum by introducing the then novel idea of quantisation that goes 
beyond any previous classical theory. He took the orbits that occur in 
classical mechanics and imposed quantisation rules onto them. 

Bohr assumed that each electron orbits the nucleus in a circle, whose 
radius r is determined by the balance between centripetal acceleration 
and the Coulomb attraction towards the proton. For electrons of mass 
m e and speed v this gives 

9 9 

m e v e 

r 47reor 2 

In SI units the strength of the electrostatic interaction between two 



(1.3) 
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5 

Older systems of units give more suc- 
cinct equations without 47reo; some of 
this neatness can be retained by keep- 
ing e 2 /47reo grouped together. 



charges of magnitude e is characterised by the combination of constants 
e 2 /47reo- 5 This leads to the following relation between the angular fre- 
quency (j = v/r and the radius: 



u 



2 



e 2 / 47 T 60 
m e r 3 



(1.4) 



This is equivalent to Kepler’s laws for planetary orbits relating the square 
of the period 2tt/uj to the cube of the radius (as expected since all steps 
have been purely classical mechanics). The total energy of an electron 
in such an orbit is the sum of its kinetic and potential energies: 



E = -m e v 2 



e 2 / 4tt€q 



(1.5) 



Using eqn 1.3 we find that the kinetic energy has a magnitude equal 
to half the potential energy (an example of the virial theorem). Taking 
into account the opposite signs of kinetic and potential energy, we find 



Em- 



e 2 / 4tt€o 
2 r 



( 1 . 6 ) 



This total energy is negative because the electron is bound to the proton 
and energy must be supplied to remove it. To go further Bohr made the 
following assumption. 



Assumption I There are certain allowed orbits for which the electron 
has a fixed energy. The electron loses energy only when it jumps between 
the allowed orbits and the atom emits this energy as light of a given 
wavelength. 



That electrons in the allowed orbits do not radiate energy is contrary 
to classical electrodynamics — a charged particle in circular motion un- 
dergoes acceleration and hence radiates electromagnetic waves. Bohr’s 
model does not explain why the electron does not radiate but simply 
takes this as an assumption that turns out to agree with the experi- 
mental data. We now need to determine which out of all the possible 
classical orbits are the allowed ones. There are various ways of doing this 
and we follow the standard method, used in many elementary texts, that 
assumes quantisation of the angular momentum in integral multiples of 
h (Planck’s constant over 2i r): 

m e vr = nh , (1.7) 



where n is an integer. Combining this with eqn 1.3 gives the radii of the 
allowed orbits as 

r = aon 2 , (1.8) 

where the Bohr radius ao is given by 

h 2 

a ° = TlT / i \ • ( L9 ) 

(e747re 0 ) m e 

This is the natural unit of length in atomic physics. Equations 1.6 and 
1.8 combine to give the famous Bohr formula: 

E = e 2 /47re 0 1 

2ao n 2 ' 



( 1 . 10 ) 
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The positive integer n is called the principal quantum number. 6 

Bohr’s formula predicts that in the transitions between these energy 
levels the atoms emit light with a wavenumber given by 

" = (L11) 



This equation fits very closely to the observed spectrum of atomic hy- 
drogen described by eqn 1.1. The Rydberg constant Rqo in eqn 1.11 is 
defined by 



hcRoo — 



2 /47re 0 ) / 



m e 



2h 2 



( 1 . 12 ) 



(i The alert reader may wonder why 
this is true since we introduced n in 
connection with angular momentum in 
eqn 1.7, and (as shown later) elec- 
trons can have zero angular momen- 
tum. This arises from the simplifica- 
tion of Bohr’s theory. Exercise 1.12 dis- 
cusses a more satisfactory, but longer 
and subtler, derivation that is closer to 
Bohr’s original papers. However, the 
important thing to remember from this 
introduction is not the formalism but 
the magnitude of the atomic energies 
and sizes. 



The factor of he multiplying the Rydberg constant is the conversion fac- 
tor between energy and wavenumbers since the value of Roo is given 
in units of m -1 (or cm -1 in commonly-used units). The measure- 
ment of the spectrum of atomic hydrogen using laser techniques has 
given an extremely accurate value for the Rydberg constant 7 R 0 0 = 
10 973 731.568 525 m -1 . However, there is a subtle difference between 
the Rydberg constant calculated for an electron orbiting a fixed nucleus 
R 0 o and the constant for real hydrogen atoms in eqn 1.1 (we originally 
wrote R without a subscript but more strictly we should specify that 
it is the constant for hydrogen Rh)- The theoretical treatment above 
has assumed an infinitely massive nucleus, hence the subscript oc. In 
reality both the electron and proton move around the centre of mass of 
the system. For a nucleus of finite mass M the equations are modified 
by replacing the electron mass m e by its reduced mass 



7 This is the 2002 CODATA recom- 
mended value. The currently accepted 
values of physical constants can be 
found on the web site of the National 
Institute of Science and Technology 
(NIST). 



m e M 
m e + M 



(1.13) 



For hydrogen 



Rh=Roo-^ R 



M n 



— Ro 




(1.14) 



where the electron-to-proton mass ratio is m e /M p ~ 1/1836. This 
reduced-mass correction is not the same for different isotopes of an el- 
ement, e.g. hydrogen and deuterium. This leads to a small but readily 
observable difference in the frequency of the light emitted by the atoms 
of different isotopes; this is called the isotope shift (see Exercises 1.1 and 
1 . 2 ). 



1.4 Relativistic effects 

Bohr’s theory was a great breakthrough. It was such a radical change 
that the fundamental idea about the quantisation of the orbits was at 
first difficult for people to appreciate — they worried about how the elec- 
trons could know which orbits they were going into before they jumped. 
It was soon realised, however, that the assumption of circular orbits is 
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o 

This has a simple interpretation in 
terms of the de Broglie wavelength 
associated with an electron A^b = 
h/m e v. The allowed orbits are those 
that have an integer multiple of de 
Broglie wavelengths around the circum- 
ference: 27r r = he. they are 

standing matter waves. Curiously, this 
idea has some resonance with modern 
ideas in string theory. 



too much of an over-simplification. Sommerfeld produced a quantum 
mechanical theory of electrons in elliptical orbits that was consistent 
with special relativity. He introduced quantisation through a general 
rule that stated ‘the integral of the momentum associated with a coor- 
dinate around one period of the motion associated with that coordinate 
is an integral multiple of Planck’s constant’. This general method can 
be applied to any physical system where the classical motion is periodic. 
Applying this quantisation rule to momentum around a circular orbit 
gives the equivalent of eqn 1.7: 8 

m e v x 2irr = nh . (1.15) 

In addition to quantising the motion in the coordinate 6 , Sommerfeld 
also considered quantisation of the radial degree of freedom r. He found 
that some of the elliptical orbits expected for a potential proportional 
to 1/r are also stationary states (some of the allowed orbits have a high 
eccentricity, more like those of comets than planets). Much effort was 
put into complicated schemes based on classical orbits with quantisation, 
and by incorporating special relativity this ‘old quantum theory’ could 
explain accurately the fine structure of spectral lines. The exact details 
of this work are now mainly of historical interest but it is worthwhile 
to make a simple estimate of relativistic effects. In special relativity a 
particle of rest mass m moving at speed v has an energy 



E ( v ) = 7 me 2 , 



(1.16) 



9 We neglect a factor of I in the bino- 
mial expansion of the expression for 7 
at low speeds, v 2 /c 2 <C 1 . 



where the gamma factor is 7 = 1 / — v 2 /c 2 . The kinetic energy of the 
moving particle is A E = E (v) — E(0 ) = (7 — 1) m e c 2 . Thus relativistic 
effects produce a fractional change in energy: 9 



A E ~ v 2 

~E~~&' 



(1.17) 



This leads to energy differences between the various elliptical orbits of 
the same gross energy because the speed varies in different ways around 
the elliptical orbits, e.g. for a circular orbit and a highly elliptical orbit 
of the same gross energy. From eqns 1.3 and 1.7 we find that the ratio 
of the speed in the orbit to the speed of light is 



v a 

c n ’ 



(1.18) 



where the fine-structure constant a is given by 



a = 



e 2 /47reo 

he 



(1.19) 



19 An electron in the Bohr orbit with 
n = 1 has speed ac. Hence it has linear 
momentum m e ac and angular momen- 
tum m e acao = h. 



This fundamental constant plays an important role throughout atomic 
physics. 10 Numerically its value is approximately a ~ 1/137 (see inside 
the back cover for a list of constants used in atomic physics). From 
eqn 1.17 we see that relativistic effects lead to energy differences of 
order a 2 times the gross energy. (This crude estimate neglects some 



1.5 



dependence on principal quantum number and Chapter 2 gives a more 
quantitative treatment of this fine structure.) It is not necessary to go 
into all the refinements of Sommerfeld’s relativistic theory that gave 
the energy levels in hydrogen very precisely, by imposing quantisation 
rules on classical orbits, since ultimately a paradigm shift was neces- 
sary. Those ideas were superseded by the use of wavefunctions in the 
Schrodinger equation. The idea of elliptical orbits provides a connection 
with our intuition based on classical mechanics and we often retain some 
traces of this simple picture of electron orbits in our minds. However, 
for atoms with more than one electron, e.g. helium, classical models do 
not work and we must think in terms of wavefunctions. 



1.5 Moseley and the atomic number 

At the same time as Bohr was working on his model of the hydrogen 
atom, H. G. J. Moseley measured the X-ray spectra of many elements. 
Moseley established that the square root of the frequency of the emitted 
lines is proportional to the atomic number Z (that he defined as the 
position of the atom in the periodic table, starting counting at Z = 1 
for hydrogen), i.e. 

y/f<xZ. (1.20) 

Moseley’s original plot is shown in Fig. 1.2. As we shall see, this equation 
is a considerable simplification of the actual situation but it was remark- 
ably powerful at the time. By ordering the elements using Z rather than 
relative atomic mass, as was done previously, several inconsistencies in 
the periodic table were resolved. There were still gaps that were later 
filled by the discovery of new elements. In particular, for the rare-earth 
elements that have similar chemical properties and are therefore difficult 
to distinguish, it was said ‘in an afternoon, Moseley could solve the prob- 
lem that had baffled chemists for many decades and establish the true 
number of possible rare earths’ (Segre 1980). Moseley’s observations can 
be explained by a relatively simple model for atoms that extends Bohr’s 
model for hydrogen. 11 

A natural way to extend Bohr’s atomic model to heavier atoms is 
to suppose that the electrons fill up the allowed orbits starting from 
the bottom. Each energy level only has room for a certain number of 
electrons so they cannot all go into the lowest level and they arrange 
themselves in shells, labelled by the principal quantum number, around 
the nucleus. This shell structure arises because of the Pauli exclusion 
principle and the electron spin, but for now let us simply consider it as an 
empirical fact that the maximum number of electrons in the n = 1 shell 
is 2, the n = 2 shell has 8 and the n = 3 shell has 18, etc. For historical 
reasons, X-ray spectroscopists do not use the principal quantum number 
but label the shells by letters: K for n = 1, L for n = 2, M for n = 3 
and so on alphabetically. 12 This concept of electronic shells explains the 
emission of X-rays from atoms in the following way. Moseley produced 
X-rays by bombarding samples of the given element with electrons that 



Moseley and the atomic number 7 



11 Tragically, Henry Gwyn Jeffreys 
Moseley was killed when he was only 
28 while fighting in the First World War 
(see the biography by Heilbron (1974)). 



12 

The chemical properties of the ele- 
ments depend on this electronic struc- 
ture, e.g. the inert gases have full shells 
of electrons and these stable configura- 
tions are not willing to form chemical 
bonds. The explanation of the atomic 
structure underlying the periodic ta- 
ble is discussed further in Section 4.1. 
See also Atkins (1994) and Grant and 
Phillips (2001). 
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Wavelength x 10 s cm 




13 

The handwriting in the bottom right 
corner states that this diagram is the 
original for Moseley’s famous paper in 
Phil Mag., 27, 703 (1914). 



Fig. 1.2 Moseley’s plot of the square root of the frequency of X-ray lines of elements 
against their atomic number. Moseley’s work established the atomic number Z as 
a more fundamental quantity than the ‘atomic weight’ (now called relative atomic 
mass). Following modern convention the units of the horizontal scales would be 
(10 8 \/Hz) at the bottom and (10 -10 m) for the log scale at the top. (Archives of the 
Clarendon Laboratory, Oxford; also shown on the Oxford physics web site.) 13 



1.5 Moseley and the atomic number 9 



had been accelerated to a high voltage in a vacuum tube. These fast 
electrons knock an electron out of an atom in the sample leaving a 
vacancy or hole in one of its shells. This allows an electron from a 
higher- lying shell to Tall down’ to fill this hole emitting radiation of a 
wavelength corresponding to the difference in energy between the shells. 

To explain Moseley’s observations quantitatively we need to modify 
the equations in Section 1.3, on Bohr’s theory, to account for the effect 
of a nucleus of charge greater than the +le of the proton. For a nuclear 
charge Ze we replace e 2 /47reo by Ze 2 / 47reo in all the equations, resulting 
in a formula for the energies like that of Balmer but multiplied by a factor 
of Z 2 . This dependence on the square of the atomic number means that, 
for all but the lightest elements, transitions between low-lying shells lead 
to emission of radiation in the X-ray region of the spectrum. Scaling the 
Bohr theory result is accurate for hydrogenic ions, i.e. systems with 
one electron around a nucleus of charge Ze. In neutral atoms the other 
electrons (that do not jump) are not simply passive spectators but partly 
screen the nuclear charge; for a given X-ray line, say the K- to L-shell 
transition, a more accurate formula is 



The screening factors <tk and <tl are not entirely independent of Z and 
the values of these screening factors for each shell vary slightly (see the 
exercises at the end of this chapter). For large atomic numbers this 
formula tends to eqn 1.20 (see Exercise 1.4). This simple approach does 
not explain why the screening factor for a shell can exceed the number 
of electrons inside that shell, e.g. <tk = 2 for Z = 74 although only 
one electron remains in this shell when a hole is formed. This does not 
make sense in a classical model with electrons orbiting around a nucleus, 
but can be explained by atomic wavefunctions — an electron with a high 
principal quantum number (and little angular momentum) has a finite 
probability of being found at small radial distances. 

The study of X-rays has developed into a whole field of its own within 
atomic physics, astrophysics and condensed matter, but there is only 
room to mention a few brief facts here. When an electron is removed 
from the K-shell the atom has an amount of energy equal to its bind- 
ing energy, i.e. a positive amount of energy, and it is therefore usual 
to draw the diagram with the K-shell at the top, as in Fig. 1.3. These 
are the energy levels of the hole in the electron shells. This diagram 
shows why the creation of a hole in a low-lying shell leads to a succes- 
sion of transitions as the hole works its way outwards through the shells. 
The hole (or equivalently the falling electron) can jump more than one 
shell at a time; each line in a series from a given shell is labelled using 
Greek letters (as in the series in hydrogen), e.g. K a , K/ 3 , . . .. The levels 
drawn in Fig. 1.3 have some sub-structure and this leads to transitions 
with slightly different wavelengths, as shown in Moseley’s plot. This is 
fine structure caused by relativistic effects that we considered for Som- 
merfeld’s theory; the substitution e 2 /47reo — > Ze 2 /47reo, as above, (or 




( 1 . 21 ) 
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Fig. 1.3 The energy levels of the inner 
shells of the tungsten atom (Z = 74) 
and the transitions between them that 
give rise to X-rays. The level scheme 
has several important differences from 
that for the hydrogen atom (Fig. 1.1). 
Firstly, the energies are tens of keV, 
as compared to eV for Z = 1, be- 
cause they scale as Z 2 (approximately). 
Secondly, the energy levels are plotted 
with n — 1 at the top because when 
an electron is removed from the K-shell 
the system has more energy than the 
neutral atom; energies are shown for 
an atom with a vacancy (missing elec- 
tron) in the K-, L-, M- and N-shells. 
The atom emits X-ray radiation when 
an electron drops down from a higher 
shell to fill a vacancy in a lower shell — 
this process is equivalent to the va- 
cancy, or hole, working its way out- 
wards. This way of plotting the ener- 
gies of the system shows clearly that 
the removal of an electron from the K- 
shell leads to a cascade of X-ray tran- 
sitions, e.g. a transition between the 
n = 1 and 2 shells gives a line in the 
K-series which is followed by a line in 
another series (L-, M-, etc.). When the 
vacancy reaches the outermost shells of 
electrons that are only partially filled 
with valence electrons with binding en- 
ergies of a few eV (the O- and P-shells 
in the case of tungsten), the transition 
energies become negligible compared to 
those between the inner shells. This 
level scheme is typical for electrons in a 
moderately heavy atom, i.e. one with 
filled K-, L-, M- and N-shells. (The 
lines of the L-series shown dotted are 
allowed X-ray transitions, but they do 
not occur following K« emission.) 



E (keV) 




equivalently a — > Za) shows that fine structure is of order (Za) 2 times 
the gross structure, which itself is proportional to Z 2 . Thus relativistic 
effects grow as Z 4 and become very significant for the inner electrons of 
heavy atoms, leading to the fine structure of the L- and M-shells seen in 
Fig. 1.3. This relativistic splitting of the shells explains why in Mose- 
ley’s plot (Fig. 1.2) there are two closely-spaced curves for the K a -line, 
and several curves for the L-series. 

Nowadays much of the X-ray work in atomic physics is carried out 
using sources such as synchrotrons; these devices accelerate electrons by 
the techniques used in particle accelerators. A beam of high-energy elec- 
trons circulates in a ring and the circular motion causes the electrons to 
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radiate X-rays. Such a source can be used to obtain an X-ray absorption 
spectrum. 14 There are many other applications of X-ray emission, e.g. 
as a diagnostic tool for the processes that occur in plasmas in fusion 
research and in astrophysical objects. Many interesting processes occur 
at ‘high energies’ in atomic physics but the emphasis in this book is 
mainly on lower energies. 



14 Absorption is easier to interpret than 
emission since only one of the terms 
in eqn 1.21 is important, e.g. E k = 
hcRoo (Z - cr K ) 2 . 



1.6 Radiative decay 



An electric dipole moment 
diates a power 15 



-exo oscillating at angular frequency u ra- 



P = 



e 2 XgC d 4 
127T60C 3 



( 1 . 22 ) 



An electron in harmonic motion has a total energy 16 of E = m e (j 2 XQ/2, 
where x$ is the amplitude of the motion. This energy decreases at a rate 
equal to the power radiated: 



d E _ e 2 uj 2 _ _E 

d t 67reom e c 3 r ’ 

where the classical radiative lifetime r is given by 



r 67T6o m e c 3 



(1.23) 



(1.24) 



For the transition in sodium at a wavelength of 589 nm (yellow light) 
this equation predicts a value of r = 16 ns ^ 10 -8 s. This is very close 
to the experimentally measured value and typical of allowed transitions 
that emit visible light. Atomic lifetimes, however, vary over a very wide 
range, 17 e.g. for the Lyman-<a transition (shown in Fig. 1.1) the upper 
level has a lifetime of only a few nanoseconds. 18,19 

The classical value of the lifetime gives the fastest time in which the 
atom could decay on a given transition and this is often close to the 
observed lifetime for strong transitions. Atoms do not decay faster than 
a classical dipole radiating at the same wavelength, but they may decay 
more slowly (by many orders of magnitude in the case of forbidden 
transitions). 20 



1.7 Einstein A and B coefficients 

The development of the ideas of atomic structure was linked to exper- 
iments on the emission, and absorption, of radiation from atoms, e.g. 
X-rays or light. The emission of radiation was considered as something 
that just has to happen in order to carry away the energy when an elec- 
tron jumps from one allowed orbit to another, but the mechanism was 
not explained. 21 In one of his many strokes of genius Einstein devised a 
way of treating the phenomenon of spontaneous emission quantitatively, 



15 

This total power equals the integral 
of the Poynting vector over a closed sur- 
face in the far- field of radiation from the 
dipole. This is calculated from the os- 
cillating electric and magnetic fields in 
this region (see electromagnetism texts 
or Corney (2000)). 

16 The sum of the kinetic and potential 
energies. 

17 The classical lifetime scales as 1/u 2 . 
However, we will find that the quantum 
mechanical result is different (see Exer- 
cise 1.8). 

18 

Higher-lying levels, e.g. n = 30, 
live for many microseconds (Gallagher 
1994). 

19 Atoms can be excited up to config- 
urations with high principal quantum 
numbers in laser experiments; such sys- 
tems are called Rydberg atoms and 
have small intervals between their en- 
ergy levels. As expected from the cor- 
respondence principle, these Rydberg 
atoms can be used in experiments that 
probe the interface between classical 
and quantum mechanics. 

20 

The ion-trapping techniques de- 
scribed in Chapter 12 can probe tran- 
sitions with spontaneous decay rates 
less than Is -1 , using single ions con- 
fined by electric and magnetic fields — 
something that was only a ‘thought 
experiment’ for Bohr and the other 
founders of quantum theory. In par- 
ticular, the effect of individual quan- 
tum jumps between atomic energy lev- 
els is observed. Radiative decay resem- 
bles radioactive decay in that individ- 
ual atoms spontaneously emit a photon 
at a given time but taking the average 
over an ensemble of atoms gives expo- 
nential decay. 

21 

A complete explanation of sponta- 
neous emission requires quantum elec- 
trodynamics. 
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22 

This treatment of the interaction of 
atoms with radiation forms the founda- 
tion for the theory of the laser, and is 
used whenever radiation interacts with 
matter (see Fox 2001). A historical ac- 
count of Einstein’s work and its pro- 
found implications can be found in Pais 
(1982). 



23 

The frequency dependence of the in- 
teraction is considered in Chapter 7. 



24 

The word laser is an acronym for light 
amplification by stimulated emission of 
radiation. 



25 This lifetime was estimated by a clas- 
sical argument in the previous section. 



Fig. 1.4 The interaction of a two-level 
atom with radiation leads to stimulated 
transitions, in addition to the sponta- 
neous decay of the upper level. 



based on an intuitive understanding of the process. 22 

Einstein considered atoms with two levels of energies, E\ and E 2 , as 
shown in Fig. 1 . 4 ; each level may have more than one state and the 
number of states with the same energy is the degeneracy of that level 
represented by g\ and g 2 . Einstein considered what happens to an atom 
interacting with radiation of energy density p( cj) per unit frequency in- 
terval. The radiation causes transitions from the lower to the upper level 
at a rate proportional to p(co 12), where the constant of proportionality 
is B 12. The atom interacts strongly only with that part of the distri- 
bution p(uj) with a frequency close to CJ12 = (E 2 — E\) /h, the atom’s 
resonant frequency. 23 By symmetry it is also expected that the radiation 
will cause transitions from the upper to lower levels at a rate dependent 
on the energy density but with a constant of proportionality B21 (the 
subscripts are in a different order for emission as compared to absorp- 
tion). This is a process of stimulated emission in which the radiation 
at angular frequency lo causes the atom to emit radiation of the same 
frequency. This increase in the amount of light at the incident frequency 
is fundamental to the operation of lasers. 24 The symmetry between up 
and down is broken by the process of spontaneous emission in which an 
atom falls down to the lower level, even when no external radiation is 
present. Einstein introduced the coefficient A21 to represent the rate of 
this process. Thus the rate equations for the populations of the levels, 
N\ and JV2, are 



dN 2 
d t 



— NiBi2p(u\2) — N2B2lp{pJ\2) — N2A21 



( 1 . 25 ) 



and 



dNi _ dN 2 
dt dt 



( 1 . 26 ) 



The first equation gives the rate of change of N2 in terms of the absorp- 
tion, stimulated emission and spontaneous emission, respectively. The 
second equation is a consequence of having only two levels so that atoms 
leaving level 2 must go into level 1; this is equivalent to a condition that 
N\ + N2 = constant. When p(cj) = 0 , and some atoms are initially in 
the upper level ( 7 V 2 ( 0 ) 7^ 0 ), the equations have a decaying exponential 
solution: 

N 2 {t) = N 2 { 0) exp (~A 21 t ) , ( 1 . 27 ) 

where the mean lifetime 25 is 



1 

r 



= A21 . 



( 1 . 28 ) 



i 

B 12 


i 

B 2 1 


A21 






y 


r y 


r 



E2\9l\ N'2 



E\\g\\ Ni 



1.8 The Zeeman effect 13 



Einstein devised a clever argument to find the relationship between the 
A 21 - and ^-coefficients and this allows a complete treatment of atoms in- 
teracting with radiation. Einstein imagined what would happen to such 
an atom in a region of black-body radiation, e.g. inside a box whose sur- 
face acts as a black body. The energy density of the radiation p{uj) du; 
between angular frequency u and uj + du; depends only on the tempera- 
ture T of the emitting (and absorbing) surfaces of the box; this function 
is given by the Planck distribution law: 26 

/ x Acj 3 1 

= 7r 2 c 3 exp(^/fc B T) - 1 ' (L29) 



Now we consider the level populations of an atom in this black-body 
radiation. At equilibrium the rates of change of N\ and N 2 (in eqn 1.26) 
are both zero and from eqn 1.25 we find that 



p(w 12 ) 



A21 1 

B21 (N 1 /N 2 )(B 1 2/B 2 i)-l m 



(1.30) 



At thermal equilibrium the population in each of the states within the 
levels are given by the Boltzmann factor (the population in each state 
equals that of the energy level divided by its degeneracy): 



N 2 

92 



Ah 

— exp 
9i 




(1.31) 



Combining the last three equations (1.29, 1.30 and 1.31) we find 27 



and 



A21 = 



B\2 = 



hw 3 

7 r 2 c 3 B 21 


(1.32) 
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— T>21 • 
91 


(1.33) 



The Einstein coefficients are properties of the atom. 28 Therefore these 
relationships between them hold for any type of radiation, from narrow- 
bandwidth radiation from a laser to broadband light. Importantly, 
eqn 1.32 shows that strong absorption is associated with strong emission. 
Like many of the topics covered in this chapter, Einstein’s treatment cap- 
tured the essential features of the physics long before all the details of 
the quantum mechanics were fully understood. 29 



1.8 The Zeeman effect 

This introductory survey of early atomic physics must include Zeeman’s 
important work on the effect of a magnetic field on atoms. The obser- 
vation of what we now call the Zeeman effect and three other crucial 
experiments were carried out just at the end of the nineteenth century, 
and together these discoveries mark the watershed between classical and 
quantum physics. 30 Before describing Zeeman’s work in detail, I shall 



26 

Planck was the first to consider radi- 
ation quantised into photons of energy 
hjj. See Pais (1986). 



27 

These equations hold for all T, so 
we can equate the parts that contain 
exp(fkc; / k^T) and the temperature- 
independent factors separately to ob- 
tain the two equations. 

28 

This is shown explicitly in Chapter 7 
by a time-dependent perturbation the- 
ory calculation of B\ 2 - 

29 

To excite a significant fraction of the 
population into the upper level of a visi- 
ble transition would require black-body 
radiation with a temperature compara- 
ble to that of the sun, and this method 
is not generally used in practice — such 
transitions are easily excited in an elec- 
trical discharge where the electrons im- 
part energy to the outermost electrons 
in an atom. (The voltage required to 
excite weakly-bound outer electrons is 
much less than for X-ray production.) 



30 Pais (1986) and Segre (1980) give his- 
torical accounts. 
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o I 

This led to the measurement of the 
atomic X-ray spectra by Moseley de- 
scribed in Section 1.5. 

on 

The field of nuclear physics was later 
developed by Rutherford, and others, 
to show that atoms have a very small 
dense nucleus that contains almost all 
the atomic mass. For much of atomic 
physics it is sufficient to think of the 
nucleus as a positive charge +Ze at the 
centre of the atoms. However, some un- 
derstanding of the size, shape and mag- 
netic moments of nuclei is necessary to 
explain the hyperfine structure and iso- 
tope shift (see Chapter 6). 



oo 

This is the same force that Thomson 
used to deflect free electrons in a curved 
trajectory to measure e/m e . Nowadays 
such cathode ray tubes are commonly 
used in classroom demonstrations. 



briefly mention the other three great breakthroughs and their signifi- 
cance for atomic physics. Rontgen discovered mysterious X-rays emit- 
ted from discharges, and sparks, that could pass through matter and 
blacken photographic film. 31 At about the same time, Bequerel’s dis- 
covery of radioactivity opened up the whole field of nuclear physics. 32 
Another great breakthrough was J. J. Thomson’s demonstration that 
cathode rays in electrical discharge tubes are charged particles whose 
charge-to-mass ratio does not depend on the gas in the discharge tube. 
At almost the same time, the observation of the Zeeman effect of a mag- 
netic field showed that there are particles with the same charge-to-mass 
ratio in atoms (that we now call electrons). The idea that atoms con- 
tain electrons is very obvious now but at that time it was a crucial piece 
in the jigsaw of atomic structure that Bohr put together in his model. 
In addition to its historical significance, the Zeeman effect provides a 
very useful tool for examining the structure of atoms, as we shall see 
at several places in this book. Somewhat surprisingly, it is possible to 
explain this effect by a classical-mechanics line of reasoning (in certain 
special cases). An atom in a magnetic field can be modelled as a simple 
harmonic oscillator. The restoring force on the electron is the same for 
displacements in all directions and the oscillator has the same resonant 
frequency cjo for motion along the x-, y- and z-directions (when there is 
no magnetic field). In a magnetic field B the equation of motion for an 
electron with charge — e, position r and velocity v = r is 

m e ^jj = —m e ujQ r — ev x B . (1-34) 

In addition to the restoring force (assumed to exist without further ex- 
planation), there is the Lorentz force that occurs for a charged particle 
moving through a magnetic field. 33 Taking the direction of the field to 
be the z- axis, B = Be z leads to 

r + 2^Lr x e z + = 0 . (1.35) 



This contains the Larmor frequency 



= 



eB 
2m e 



(1.36) 



We use a matrix method to solve the equation and look for a solution 
in the form of a vector oscillating at uj: 



r = Re 



y | exp (-i wt) 



(1.37) 



Written in matrix form, eqn 1.35 reads 




— 2ia;fA 




(1.38) 
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The eigenvalues j 2 are found from the following determinant: 



Uq — j 2 — 2kjf^L 0 

2ic<;f)L Uq — u 2 0 

0 0 Jq — j 2 



= 0. 



(1.39) 



This gives {a; 4 — (2jfi + u 2 +^o} (^ — ^o) = 0- The solution 
a; = a;o is obvious by inspection. The other two eigenvalues can be found 
exactly by solving the quadratic equation for j 2 inside the curly brackets. 
For an optical transition we always have Qj, <C Jo so the approximate 
eigenfrequencies are j ~ j 0 ± TLl- Substituting these values back into 
eqn 1.38 gives the eigenvectors corresponding to j = Jo — Jo and 
Jo + respectively, as 

/ COS ((Jo f2p) t \ / 0 \ 

r = — sin (jo ~ )t J , I 0 J 

y 0 J \ cosc^o t J 

( cos (cj 0 + fl L ) t \ 

sin (cjo + fi L ) t 

o J 



The magnetic field does not affect motion along the z-axis and the angu- 
lar frequency of the oscillation remains uq. Interaction with the magnetic 
field causes the motions in the x- and //-directions to be coupled together 
(by the off-diagonal elements ±2io;f2L of the matrix in eqn 1.38). 34 The 
result is two circular motions in opposite directions in the xy- plane, as 
illustrated in Fig. 1.5. These circular motions have frequencies shifted 
up, or down, from (Jo by the Larmor frequency. Thus the action of the 
external field splits the original oscillation at a single frequency (actu- 
ally three independent oscillations all with the same frequency, cjo) into 
three separate frequencies. An oscillating electron acts as a classical 
dipole that radiates electromagnetic waves and Zeeman observed the 
frequency splitting in the light emitted by the atom. 

This classical model of the Zeeman effect explains the polarization 
of the light, as well as the splitting of the lines into three components. 
The calculation of the polarization of the radiation at each of the three 
different frequencies for a general direction of observation is straight- 
forward using vectors; 35 however, only the particular cases where the 
radiation propagates parallel and perpendicular to the magnetic field 
are considered here, i.e. the longitudinal and transverse directions of 
observation, respectively. An electron oscillating parallel to B radiates 
an electromagnetic wave with linear polarization and angular frequency 
c^o- This 7r-component of the line is observed in all directions except 
along the magnetic field; 36 in the special case of transverse observation 
(i.e. in the xy- plane) the polarization of the 7r-component lies along 
e z . The circular motion of the oscillating electron in the xy - plane at 
angular frequencies Jo + and Jo — f^L produces radiation at these 
frequencies. Looking transversely, this circular motion is seen edge-on 
so that it looks like linear sinusoidal motion, e.g. for observation along 



A 

B 

A 



k\Z 




Fig. 1.5 A simple model of an atom 
as an electron that undergoes simple 
harmonic motion explains the features 
of the normal Zeeman effect of a mag- 
netic field (along the 2 -axis). The 
three eigenvectors of the motion are: 
e z cosu;ot and cos ({cjo ± ^l} t) e x ± 
sin ({cjq ±^ L }t) e y . 



34 

The matrix does not have off- 
diagonal elements in the last column 
or bottom row, so the x- and y- 
components are not coupled to the z- 
component, and the problem effectively 
reduces to solving a 2 x 2 matrix. 

35 Some further details are given in Sec- 
tion 2.2 and in Woodgate (1980). 

36 An oscillating electric dipole pro- 
portional to e z cosu)ot does not radi- 
ate along the 2 -axis — observation along 
this direction gives a view along the 
axis of the dipole so that effectively the 
motion of the electron cannot be seen. 
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Intensity (longitudinal) 




Intensity (transverse) 





Fig. 1.6 For the normal Zeeman effect a simple model of an atom (as in Fig. 1.5) explains the frequency of the light emitted 
and its polarization (indicated by the arrows for the cases of transverse and longitudinal observation). 



the x-axis only the ^/-component is seen, and the radiation is linearly 
polarized perpendicular to the magnetic field — see Fig. 1.6. These are 
called the cr-components and, in contrast to the 7r-component, they are 
also seen in longitudinal observation — looking along the z - axis one sees 
the electron’s circular motion and hence light that has circular polariza- 
tion. Looking in the opposite direction to the magnetic field (from the 
positive ^-direction, or 0 = 0 in polar coordinates) the circular motion 
37 This is left-circularly-polarized light in the anticlockwise direction is associated with the frequency cjo + ^l- 37 
(Corney 2000). In addition to showing that atoms contain electrons by measuring the 

magnitude of the charge-to-mass ratio e/m e , Zeeman also deduced the 
sign of the charge by considering the polarization of the emitted light. 
If the sign of the charge was not negative, as we assumed from the start, 
light at cjo + would have the opposite handedness — from this Zeeman 

could deduce the sign of the electron’s charge. 

For situations that only involve orbital angular momentum (and no 
spin) the predictions of this classical model correspond exactly to those 
of quantum mechanics (including the correct polarizations), and the in- 
tuition gained from this model gives useful guidance in more complicated 
cases. Another reason for studying the classical treatment of the Zee- 
man effect is that it furnishes an example of degenerate perturbation 
theory in classical mechanics. We shall encounter degenerate perturba- 
tion theory in quantum mechanics in several places in this book and an 
understanding of the analogous procedure in classical mechanics is very 
helpful. 



1.8 



The Zeeman effect 17 



1.8.1 Experimental observation of the Zeeman 
effect 

Figure 1.7(a) shows an apparatus suitable for the experimental observa- 
tion of the Zeeman effect and Fig. 1.7(b-e) shows some typical experi- 
mental traces. A low-pressure discharge lamp that contains the atom to 
be studied (e.g. helium or cadmium) is placed between the pole pieces 
of an electromagnet capable of producing fields of up to about 1 T. In 
the arrangement shown, a lens collects light emitted perpendicular to 
the field (transverse observation) and sends it through a Fabry-Perot 
etalon. The operation of such etalons is described in detail by Brooker 
(2003), and only a brief outline of the principle of operation is given 
here. 





Lamp 



Electromagnet 



Signal 




Fig. 1.7 (a) An apparatus suitable 
for the observation of the Zeeman ef- 
fect. The light emitted from a dis- 
charge lamp, between the pole pieces 
of the electromagnet, passes through 
a narrow-band filter and a Fabry- 
Perot etalon. Key: LI, L2 are lenses; 
F- filter; P- polarizer to discriminate 
between 7r- and cr-polarizations (op- 
tional); Fabry-Perot etalon made of 
a rigid spacer between two highly- 
reflecting mirrors (Ml and M2); D — 
detector. Other details can be found in 
Brooker (2003). A suitable procedure is 
to (partially) evacuate the etalon cham- 
ber and then allow air (or a gas with a 
higher refractive index such as carbon 
dioxide) to leak in through a constant- 
flow-rate valve to give a smooth linear 
scan. Plots (b) to (e) show the inten- 
sity / of light transmitted through the 
Fabry-Perot etalon. (b) A scan over 
two free-spectral ranges with no mag- 
netic field. Both (c) and (d) show a Zee- 
man pattern observed perpendicular to 
the applied field; the spacing between 
the 7r- and cr-components in these scans 
is one-quarter and one-third of the free- 
spectral range, respectively — the mag- 
netic field in scan (c) is weaker than 
in (d). (e) In longitudinal observation 
only the cr-components are observed — 
this scan is for the same field as in (c) 
and the cr-components have the same 
position in both traces. 
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38 

It equals the potential energy of the 
electron in the first Bohr orbit. 



39 

This Larmor frequency equals the 
splitting between the 7r- and cr- 
components in the normal Zeeman ef- 
fect. 



• Light from the lamp is collected by a lens and directed on to an 
interference filter that transmits only a narrow band of wavelengths 
corresponding to a single spectral line. 

• The etalon produces an interference pattern that has the form of con- 
centric rings. These rings are observed on a screen in the focal plane 
of the lens placed after the etalon. A small hole in the screen is po- 
sitioned at the centre of the pattern so that light in the region of the 
central fringe falls on a detector, e.g. a photodiode. (Alternatively, 
the lens and screen can be replaced by a camera that records the ring 
pattern on film.) 

• The effective optical path length between the two flat highly-reflecting 
mirrors is altered by changing the pressure of the air in the cham- 
ber; this scans the etalon over several free-spectral ranges while the 
intensity of the interference fringes is recorded to give traces as in 
Fig. 1.7(b-e). 



1.9 Summary of atomic units 



This chapter has used classical mechanics and elementary quantum ideas 
to introduce the important scales in atomic physics: the unit of length 
ao and a unit of energy hcR^. The natural unit of energy is e 2 /47reoao 
and this unit is called a hartree. 38 This book, however, expresses energy 
in terms of the energy equivalent to the Rydberg constant, 13.6 eV; this 
equals the binding energy in the first Bohr orbit of hydrogen, or 1/2 a 
hartree. These quantities have the following values: 

h? 

a 0 = TTU \ = 5.29 x 10" 11 m , (1.40) 

(e 2 /47re 0 ) m e 



HcRqq 



m e (e 2 /47re 0 ) 2 

2fr? 



13.6 eV. 



(1.41) 



The use of these atomic units makes the calculation of other quantities 
simple, e.g. the electric field in a hydrogen atom at radius r = ao equals 
e/(47reoao). This corresponds to a potential difference of 27.2 V over a 
distance of ao, or a field of 5 x 10 11 V m -1 . 

Relativistic effects depend on the dimensionless fine-structure con- 
stant a: 

(e 2 /47re 0 ) _ 1 
He ~ 137 ' 



a = 



(1.42) 



The Zeeman effect of a magnetic field on atoms leads to a frequency shift 
of in eqn 1.36. 39 In practical units the size of this frequency shift is 



2 TT# 



— ^ — = 14 GHz T _1 . 
47rm e 



(1.43) 



Equating the magnetic energy HQb with /^bB , the magnitude of the 
energy for a magnetic moment hb in a magnetic flux density F>, shows 
that the unit of atomic magnetic moment is the Bohr magneton 



eh 
2 m e 



= 9.27 x 10“ 24 JT -1 . 



Mb = 



(1.44) 
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This magnetic moment depends on the properties of the unpaired elec- 
tron (or electrons) in the atom, and has a similar magnitude for all 
atoms. In contrast, other atomic properties scale rapidly with the nu- 
clear charge; hydrogenic systems have energies proportional to Z 2 , and 
the same reasoning shows that their size is proportional to 1/Z (see 
eqns 1.40 and 1.41). For example, hydrogenic uranium U +91 has been 
produced in accelerators by stripping 91 electrons off a uranium atom 
to leave a single electron that has a binding energy of 92 2 x 13.6 eV = 
115keV (for n = 1) and an orbit of radius ao/92 = 5.75 x 10 -13 m = 
575 fm. The transitions between the lowest energy levels of this system 
have short wavelengths in the X-ray region. 40 

The reader might think that it would be a good idea to use the same 
units across the whole of atomic physics. In practice, however, the units 
reflect the actual experimental techniques used in a particular region 
of the spectrum, e.g. radio-frequency, or microwave synthesisers, are 
calibrated in Hz (kHz, MHz and GHz); the equation for the angle of 
diffraction from a grating is expressed in terms of a wavelength; and 
for X-rays produced by tubes in which electrons are accelerated by high 
voltages it is natural to use keV. 41 A table of useful conversion factors 
is given inside the back cover. 

The survey of classical ideas in this chapter gives a historical perspec- 
tive on the origins of atomic physics but it is not necessary, or indeed 
in some cases downright confusing, to go through a detailed classical 
treatment — the physics at the scale of atomic systems can only properly 
be described by wave mechanics and this is the approach used in the 
following chapters. 42 



40 Energies can be expressed in terms 
of the rest mass energy of the electron 
m e c 2 = 0.511 MeV. The gross energy 
is (Za) 2 ^m e c 2 and the fine structure 
is of order (Za) 4 |m e c 2 . 



41 Laser techniques can measure tran- 
sition frequencies of around 10 15 Hz 
directly as a frequency to determine 
a precise value of the Rydberg con- 
stant, and there are no definite rules for 
whether a transition should be specified 
by its energy, wavelength or frequency. 

42 

X-ray spectra are not discussed again 
in this book and further details can be 
found in Kuhn (1969) and other atomic 
physics texts. 



Exercises 



Isotope shift 

The deuteron has approximately twice the mass of 


H (nm) 


He+ (nm 


the proton. Calculate the difference in the wave- 
length of the Balmer-a line in hydrogen and deu- 


656.28 

486.13 


656.01 

541.16 


terium. 


434.05 


485.93 


The energy levels of one- electron atoms 


410.17 


454.16 


The table gives the wavelength 43 of lines observed 
in the spectrum of atomic hydrogen and singly- 
ionized helium. Explain as fully as possible the 
similarities and differences between the two spec- 
tra. 




433.87 

419.99 

410.00 



43 These are the wavelengths in air with a refractive index of 1.0003 in the visible region. 
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(1.3) Relativistic effects 

Evaluate the magnitude of relativistic effects in 
the n — 2 level of hydrogen. What is the resolv- 
ing power A/ (A A) min of an instrument that could 
observe these effects in the Balmer-o line? 

(1.4) X-rays 

Show that eqn 1.21 approximates to eqn 1.20 when 
the atomic number Z is much greater than the 
screening factors. 

(1.5) X-rays 

It is suspected that manganese (Z — 25) is very 
poorly mixed with iron (Z = 26) in a block of al- 
loy. Predict the energies of the K-absorption edges 
of these elements and determine an X-ray photon 
energy that would give good contrast (between re- 
gions of different concentrations) in an X-ray of 
the block. 

(1.6) X-ray experiments 

Sketch an apparatus suitable for X-ray spectro- 
scopy of elements, e.g. Moseley’s experiment. 
Describe the principle of its operation and the 
method of measuring the energy, or wavelength, 
of X-rays. 

(1.7) Fine structure in X-ray transitions 

Estimate the magnitude of the relativistic effects 
in the L-shell of lead (Z = 82) in keV. Also express 
you answer as a fraction of the K a transition. 

(1.8) Radiative lifetime 

For an electron in a circular orbit of radius r 
the electric dipole moment has a magnitude of 
D — —er and radiates energy at a rate given by 
eqn 1.22. Find the time taken to lose an energy of 
hi o. 

Use your expression to estimate the transition rate 
for the n — 3 to n = 2 transition in hydrogen that 
emits light of wavelength 656 nm. 

Comment. This method gives 1/r oc (er) 2 o; 3 , 
which corresponds closely to the quantum mechan- 
ical result in eqn 7.23. 

(1.9) Black-body radiation 

Two- level atoms with a transition at wavelength 
A = 600 nm, between the levels with degeneracies 
gi = 1 and #2 = 3, are immersed in black-body 
radiation. The fraction in the excited state is 0.1. 
What is the temperature of the black body and the 
energy density per unit frequency interval p(uj 12 ) 
of the radiation at the transition frequency? 

(1.10) Zeeman effect 

What is the magnitude of the Zeeman shift for an 
atom in (a) the Earth’s magnetic field, and (b) a 



magnetic flux density of 1 T? Express your answers 
in both MHz, and as a fraction of the transition 
frequency A/// for a spectral line in the visible. 

(1.11) Relative intensities in the Zeeman effect 
Without an external field, an atom has no pre- 
ferred direction and the choice of quantisation axis 
is arbitrary. In these circumstances the light emit- 
ted cannot be polarized (since this would establish 
a preferred orientation). As a magnetic field is 
gradually turned on we do not expect the intensi- 
ties of the different Zeeman components to change 
discontinuously because the field has little effect 
on transition rates. This physical argument im- 
plies that oppositely-polarized components emit- 
ted along the direction of the field must have equal 
intensities, i.e. I a + = I a - (notation defined in 
Fig. 1.6). What can you deduce about 

(a) the relative intensities of the components 
emitted perpendicularly to the field? 

(b) the ratio of the total intensities of light emit- 
ted along and perpendicular to the field? 

(1.12) Bohr theory and the correspondence principle 
This exercise gives an alternative approach to the 
theory of the hydrogen atom presented in Sec- 
tion 1.3 that is close to the spirit of Bohr’s original 
papers. It is somewhat more subtle than that usu- 
ally given in elementary textbooks and illustrates 
Bohr’s great intuition. Rather than the ad hoc as- 
sumption that angular momentum is an integral 
multiple of h (in eqn 1.7), Bohr used the corre- 
spondence principle. This principle relates the be- 
haviour of a system according to the known laws 
of classical mechanics and its quantum properties. 

Assumption II The correspondence principle 
states that in the limit of large quantum numbers 
a quantum system tends to the same limit as the 
corresponding classical system. 

Bohr formulated this principle in the early days 
of quantum theory. To apply this principle to hy- 
drogen we first calculate the energy gap between 
adjacent electron orbits of radii r and r' . For large 
radii, the change A r = r — r <C r. 

(a) Show that the angular frequency uj = A E/h 
of radiation emitted when an electron makes 
a quantum jump between these levels is 

e 2 /47reo A r 

(jj ^ — . 

- 2 h r 2 

(b) An electron moving in a circle of radius r acts 
as an electric dipole radiating energy at the 
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orbital frequency c o given by eqn 1.4. Verify 
that this equation follows from eqn 1.3. 

(c) In the limit of large quantum numbers, the 
quantum mechanical and classical expressions 
give the same frequency uj. Show that equat- 
ing the expressions in the previous parts yields 
Ar = 2 (a 0 r) 1/2 . 

(d) The difference in the radii between two ad- 
jacent orbits can be expressed as a difference 
equation. 44 In this case An = 1 and 



This equation can be solved by assuming that 
the radius varies as some power x of the quan- 
tum number n, e.g. if one orbit is labelled 
by an integer n and the next by n + 1, then 
r — an x and r' — a (n + l) x . Show that 
A r = axn x ~ x oc n x ^ 2 . Determine the power x 
and the constant a. 

Web site: 

http : //www. physics . ox. ac .uk/users/f oot 

This site has answers to some of the exercises, 



Comment. We have found eqn 1.8 from the cor- 
respondence principle without considering angular 
momentum. The allowed energy levels are easily 
found from this equation as in Section 1.3. The re- 
markable feature is that, although the form of the 
equation was derived for high values of the prin- 
cipal quantum number, the result works down to 
n — 1. 

(1.13) Rydberg atoms 

(a) Show that the energy of the transitions be- 
tween two shells with principal quantum num- 
bers n and n = n + 1 is proportional to 1/n 3 
for large n. 

(b) Calculate the frequency of the transition be- 
tween the n — 51 and n = 50 shells of a 
neutral atom. 

(c) What is the size of an atom in these Rydberg 
states ? Express your answer both in atomic 
units and in metres. 



corrections and other supplementary information. 



44 A difference equation is akin to a differential equation but without letting the differences become infinitesimal. 
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The simple hydrogen atom has had a great influence on the development 
of quantum theory, particularly in the first half of the twentieth century 
when the foundations of quantum mechanics were laid. As measurement 
techniques improved, finer and finer details were resolved in the spec- 
trum of hydrogen until eventually splittings of the lines were observed 
that cannot be explained even by the fully relativistic formulation of 
quantum mechanics, but require the more advanced theory of quantum 
electrodynamics. In the first chapter we looked at the Bohr-Sommerfeld 
theory of hydrogen that treated the electron orbits classically and im- 
posed quantisation rules upon them. This theory accounted for many of 
the features of hydrogen but it fails to provide a realistic description of 
systems with more than one electron, e.g. the helium atom. Although 
the simple picture of electrons orbiting the nucleus, like planets round 
the sun, can explain some phenomena, it has been superseded by the 
Schrodinger equation and wavefunctions. This chapter outlines the ap- 
plication of this approach to solve Schrodinger’s equation for the hydro- 
gen atom; this leads to the same energy levels as the Bohr model but 
the wavefunctions give much more information, e.g. they allow the rates 
of the transitions between levels to be calculated (see Chapter 7). This 
chapter also shows how the perturbations caused by relativistic effects 
lead to fine structure. 



2.1 The Schrodinger equation 



The solution of the Schrodinger equation for a Coulomb potential is 
in every quantum mechanics textbook and only a brief outline is given 
x The emphasis is on the properties of here. 1 The Schrodinger equation for an electron of mass m e in a 
the wavefunctions rather than how to spherically-symmetric potential is 
solve differential equations. 



2 

The operator for linear momentum is 
p = — iftV and for angular momentum 
it is h\ = r x p. This notation differs in 
two ways from that commonly used in 
quantum texts. Firstly, h is taken out- 
side the angular momentum operators, 
and secondly, the operators are written 
without ‘hats’. This is convenient for 
atomic physics, e.g. in the vector model 
for the addition of angular momenta. 



{£r +nr) V= E *- (2 - i) 

This is the quantum mechanical counterpart of the classical equation 
for the conservation of total energy expressed as the sum of kinetic and 
potential energies. 2 In spherical polar coordinates we have 



V 2 = 



(r 2 —) - — l 2 , 

r 2 dr \ dr J r 2 ’ 



( 2 . 2 ) 
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where the operator l 2 contains the terms that depend on 0 and 0, namely 
1 d 



V = - 



sin 6 dQ 



. „ d\ 1 d 2 

Sm 9 00 ) + sin 2 6 dcj) 2 



(2-3) 



and h 2 1 2 is the operator for the orbital angular momentum squared. Fol- 
lowing the usual procedure for solving partial differential equations, we 
look for a solution in the form of a product of functions ij; = R(r)Y (0, </>). 
The equation separates into radial and angular parts as follows: 



IA ( r 2 d Jl\ 

Rdr \ dr J 



2 m e r 2 

H 2 



{V(r) - E} = yl 2 Y . 



(2.4) 



Each side depends on different variables and so the equation is only 
satisfied if both sides equal a constant that we call b. Thus 



l 2 y = bY. 



(2.5) 



This is an eigenvalue equation and we shall use the quantum theory of 
angular momentum operators to determine the eigenfunctions F(0, 0). 



2.1.1 Solution of the angular equation 

To continue the separation of variables we substitute Y = 0(0) <£>(</>) into 
eqn 2.5 to obtain 



sin 0 d 
~Q~dO 




+ b sin 2 0 



1 < 9 2 $ 
$ S02 



= const . 



(2.6) 



The equation for <F(0) is the same as in simple harmonic motion, so 3 



$ = Ae im0 + Be ~ irn(t> . 



(2.7) 



The constant on the right-hand side of eqn 2.6 has the value m 2 . Phys- 
ically realistic wavefunctions have a unique value at each point and this 
imposes the condition <F(</> + 27 r) = <£>(</>), so m must be an integer. 
The function <F(0) is the sum of eigenfunctions of the operator for the 
^-component of orbital angular momentum 

d 

hl z = — i h— . (2.8) 



The function e 1171 ^ has magnetic quantum number m and its complex 
conjugate has magnetic quantum number — m. 4 

A convenient way to find the function Y ( 0 , 0) and its eigenvalue b in 
eqn 2.5 5 is to use the ladder operators IM\= l x + i l y and /_ = l x — il y - 
These operators commute with l 2 , the operator for the total angular 
momentum squared (because l x and l y commute with l 2 ); therefore, the 
three functions F, Z+F and Z_F are all eigenfunctions of l 2 with the 
same eigenvalue b (if they are non-zero, as discussed below). The ladder 



3 

A and B are arbitrary constants. 
Alternatively, the solutions can be 
written in terms of real functions as 
A' sin(mb) + B' cos 



4 The operator —d 2 /d(f) 2 = l 2 and con- 
sequently <£((/>) is an eigenfunction of l 2 
with eigenvalue m 2 . 

The solution of equations involving 
the angular part of V 2 arises in many 
situations with spherical symmetry, e.g. 
in electrostatics, and the same mathe- 
matical tools could be used here to de- 
termine the properties of the spherical 
harmonic functions, but angular mo- 
mentum methods give more physical in- 
sight for atoms. 
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6 The raising operator contains the fac- 
tor e 1 ^ , so that when it acts on an eigen- 
function of the form Y oc ©(#)e im< ^ 
the resulting function l+Y contains 
e i(m+i)0_ The ^-dependent part of this 
function is found below. 

These properties follow from the com- 
mutation relations for angular momen- 
tum operators (see Exercise 2.1). 

8 

This statement can be proved rigor- 
ously using angular momentum opera- 
tors, as shown in quantum mechanics 
texts. 

9 The dubious reader can easily check 
that Z+Y m = 0. It is trivially obvious 
that l z Yi^i — l Y^i, where m = l for this 
function. 

Yy-i = 0 and m = — 1 is the low- 
est eigenvalue of l z . Proportional signs 
have been used to avoid worrying about 
normalisation; this leaves an ambiguity 
about the relative phases of the eigen- 
functions but we shall choose them in 
accordance with usual convention. 

11 The relation Yq_ m = shows 

that, if m ma x = l, then m m - in = —l. 
Between these two extremes there are 
21 + 1 possible values of the magnetic 
quantum number m for each l. Note 
that the orbital angular momentum 
quantum number l is not the same as 
the length of the angular momentum 
vector (in units of K). Quantum me- 
chanics tells us only that the expecta- 
tion value of the square of the orbital 
angular momentum is 1(1 + 1), in units 
of h 2 . The length itself does not have a 
well-defined value in quantum mechan- 
ics and it does not make sense to re- 
fer to it. When people say that an 
atom has ‘orbital angular momentum 
of one, two, etc.’, strictly speaking they 
mean that the orbital angular momen- 
tum quantum number l is 1, 2, etc. 



operators can be expressed in polar coordinates as: 

4=e ‘*(^ +icote ^)' 



(2 9) 



The operator Z + transforms a function with magnetic quantum number 
m into another angular momentum eigenfunction that has eigenvalue 
m+l. Thus Z + is called the raising operator . 6 The lowering operator Z_ 
changes the magnetic quantum number in the other direction, m — > m — 
1. It is straightforward to prove these statements and other properties 
of these operators; 7 however, the purpose of this section is not to present 
the general theory of angular momentum but simply to outline how to 
find the eigenfunctions (of the angular part) of the Schrodinger equation. 

Repeated application of the raising operator does not increase m 
indefinitely — for each eigenvalue b there is a maximum value of the mag- 
netic quantum number 8 that we shall call /, i.e. ra max = l. The raising 
operator acting on an eigenfunction with m max gives zero since by def- 
inition there are no eigenfunctions with m > ra max . Thus solving the 
equation l+Y = 0 (Exercise 2.11) we find that the eigenfunctions with 
m max = l have the form 

Y oc sm l Qe il< f > . (2.10) 

Substitution back into eqn 2.5 shows that these are eigenfunctions l 2 with 
eigenvalue b = l (l + 1), and l is the orbital angular momentum quantum 
number. The functions T/ ?m (6 > , 0) are labelled by their eigenvalues in the 
conventional way. 9 For l = 0 only mm 0 exists and Yo,o is a constant 
with no angular dependence. For l = 1 we can find the eigenfunctions 
by starting from the one with l = 1 = m (in eqn 2.10) and using the 
lowering operator to find the others: 



Yi,i ocsinfle^, 

Yyo oc /_ Yqi oc cos 0 , 
Fi,_i(x/_Fi,oocsin<9e-^. 

This gives all three eigenfunctions expected for l = l. 10 For l = 2 this 
procedure gives 

12,2 oc sin 2 0e l2 ^, 

12,-2 oc sin^e -12 ^. 

These are the five eigenfunctions with m = 2,1,0,— 1,-2. 11 Normalised 
angular functions are given in Table 2.1. 

Any angular momentum eigenstate can be found from eqn 2.10 by 
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Table 2.1 Orbital angular momentum eigenfunctions. 




repeated application of the lowering operator: 12 



Y l>m oc sm l 0e il *. 



12 

This eigenfunction has magnetic 
quantum number l — (l — m) = m. 

( 2 . 11 ) 



To understand the properties of atoms, it is important to know what 
the wavefunctions look like. The angular distribution needs to be mul- 
tiplied by the radial distribution, calculated in the next section, to give 
the square of the wavefunction as 



|</> (r,9,cf>)\ 2 =Rl l (r)\Y l , m (6,cl>)\ 2 . (2.12) 



This is the probability distribution of the electron, or — e \^p\ 2 can be in- 
terpreted as the electronic charge distribution. Many atomic properties, 
however, depend mainly on the form of the angular distribution and 
Fig. 2.1 shows some plots of |Y/ ?m | 2 . The function |Yo,o| 2 is spherically 
symmetric. The function |Yi ? o| 2 has two lobes along the 2 )- axis. The 
squared modulus of the other two eigenfunctions of l = 1 is proportional 
to sin 2 6. As shown in Fig. 2.1(c), there is a correspondence between 
these distributions and the circular motion of the electron around the 
z - axis that we found as the normal modes in the classical theory of the 
Zeeman effect (in Chapter l). 13 This can be seen in Cartesian coordi- 
nates where 



*i,o oc - , 
r 



Fi,KX 
Fi,-! oc 



X + iy 
r 

x-iy 



(2.13) 



13 

Stationary states in quantum 
mechanics correspond to the time- 
averaged classical motion. In this 
case both directions of circular mo- 
tion about the a?-axis give the same 
distribution. 
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Fig. 2.1 Polar plots of the squared modulus of the angular wavefunctions for the hydrogen atom with l = 0 and 1. For each 
value of the polar angle 6 a point is plotted at a distance proportional to \Y(6,cf))\ 2 from the origin. Except for (d), the plots 
have rotational symmetry about the z-axis and look the same for any value of </>. (a) |Yo,o| 2 is spherical, (b) |Yi 5 o| 2 oc cos 2 6 
has two lobes along the z- axis, (c) | Vi , 1 1 2 oc sin 2 6 has an ‘almost’ toroidal shape — this function equals zero for 0 = 0. (|Yi,-i | 2 
looks the same.) (d) lYi^f — Yiq—il 2 oc | cc/r* | 2 has rotational symmetry about the £-axis and this polar plot is drawn for <j> = 0; 
it looks like (b) but rotated through an angle of 7r/2. (e) |Y2,2 1 2 oc sin 4 9. 



Any linear combination of these is also an eigenfunction of l 2 , e.g. 



14 In the absence of an external field 
to break the spherical symmetry, all 
axes are equivalent, i.e. the atom does 
not have a preferred direction so there 
is symmetry between the x- : y- and 
^-directions. In an external magnetic 
field the states with different values of 
m (but the same l) are not degenerate 
and so linear combinations of them are 
not eigenstates of the system. 



Y\ _i — Y\ i oc — = sin# cos 0 , (2.14) 

r 

Y\ _i + Y\ i oc - == sin# sin0 . (2.15) 

r 

These two real functions have the same shape as Yyo oc z/r but are 
aligned along the x- and y- axes, respectively. 14 In chemistry these dis- 
tributions for / = 1 are referred to as p-orbitals. Computer programs 
can produce plots of such functions from any desired viewing angle (see 
Blundell 2001, Fig. 3.1) that are helpful in visualising the functions with 
l > 1. (For l = 0 and 1 a cross-section of the functions in a plane that 
contains the symmetry axis suffices.) 



2.1.2 Solution of the radial equation 

An equation for R(r) is obtained by setting eqn 2.4 equal to the constant 
b = 1(1 + 1) and putting in the Coulomb potential V(r) = — e 2 /47reo r. It 
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can be cast in a convenient form by the substitution P (r) = rR (r) : 



_tf_d 2 _P_ \ h 2 l {l Pi) 

2 m e dr 2 \ 2 m e r 2 




P = 0. 



(2.16) 



The term proportional to /(/ + l)/r 2 is the kinetic energy associated 
with the angular degrees of freedom; it appears in this radial equation 
as an effective potential that tends to keep wavefunctions with l ^ 0 
away from the origin. Dividing through this equation by E = —\E\ 
(a negative quantity since E ^ 0 for a bound state) and making the 
substitution 

2 2 m e \E\ r 

^ h 2 

reduces the equation to the dimensionless form 



(2.17) 



d 2 P f 1(1 + 1) A 

dp 2 \ p 2 p 



P = 0. 



(2.18) 



The constant that characterises the Coulomb interaction strength is 



e 2 I 2 m e 



(2.19) 



The standard method of solving such differential equations is to look for 
a solution in the form of a series. The series solutions have a finite num- 
ber of terms and do not diverge when A = 2 n, where n is an integer. 15 
Thus, from eqn 2.19, these wavefunctions have eigenenergies given by 16 



E = 



2m e (e 2 /47re 0 ) 2 1 



h 2 



A 2 



= —hcRry 



( 2 . 20 ) 



This shows that the Schrodinger equation has stationary solutions at en- 
ergies given by the Bohr formula. The energy does not depend on /; this 
accidental degeneracy of wavefunctions with different l is a special fea- 
ture of Coulomb potential. In contrast, degeneracy with respect to the 
magnetic quantum number mi arises because of the system’s symmetry, 
i.e. an atom’s properties are independent of its orientation in space, in 
the absence of external fields. 17 The solution of the Schrodinger equation 
gives much more information than just the energies; from the wavefunc- 
tions we can calculate other atomic properties in ways that were not 
possible in the Bohr-Sommerfeld theory. 

We have not gone through the gory details of the series solution, but 
we should examine a few examples of radial wavefunctions (see Ta- 
ble 2.2). Although the energy depends only on n, the shape of the 
wavefunctions depends on both n and l and these two quantum num- 
bers are used to label the radial functions R n g(r). For n = 1 there is 
only the l = 0 solution, namely i7qo oc e -p . For n = 2 the orbital 
angular momentum quantum number is l = 0 or 1, giving 



15 The solution has the general form 
P (p) = Ce~ p v (p), where v(p) is an- 
other function of the radial coordinate, 
for which there is a polynomial solution 
(see Woodgate 1980 and Rae 1992). 

16 Using eqn 1.41. 



IT 

This is true for any spherically- 
symmetric potential V(r). 



# 2,0 oc (1 - p)e p , 
# 2,1 oc pe~ p . 



28 The hydrogen atom 



Table 2.2 Radial hydrogenic wavefunctions R n ,i in terms of the variable p = 
Zr/{nao ), which gives a scaling that varies with n. The Bohr radius ao is defined in 
eqn 1.40. 




These show a general a feature of hydrogenic wavefunctions, namely 
that the radial functions for l = 0 have a finite value at the origin, i.e. 
the power series in p starts at the zeroth power. Thus electrons with 
l = 0 (called s-electrons) have a finite probability of being found at the 
position of the nucleus and this has important consequences in atomic 
physics. 

Inserting \E\ from eqn 2.20 into eqn 2.17 gives the scaled coordinate 



P 



Z r 
n ao ’ 



( 2 . 21 ) 



where the atomic number has been incorporated by the replacement 
e 2 /47reo — > Ze 2 / 47reo (as in Chapter 1). There are some important prop- 
erties of the radial wavefunctions that require a general form of the 
solution and for future reference we state these results. The probability 
density of electrons with l = 0 at the origin is 



|Vv=o (0)| 2 




For electrons with l ^ 0 the expectation value of 1/r 3 is 




1 



Rl,i ( r ) r 2 dr 



1 

d* + £)(* + !) 




( 2 . 22 ) 



(2.23) 



These results have been written in a form that is easy to remember; 
they must both depend on 1/clq in order to have the correct dimensions 
and the dependence on Z follows from the scaling of the Schrodinger 
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equation. The dependence on the principal quantum number n also 
seems to follow from eqn 2.21 but this is coincidental; a counterexample 



1 

r 



1 




(2.24) 
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The wavefunction solutions of the Schrodinger equation for particular 
energies are standing waves and give a distribution of electronic charge 
— e | ip (r)| 2 that is constant in time. We shall now consider how transi- 
tions between these stationary states occur when the atom interacts with 
electromagnetic radiation that produces an oscillating electric field 19 



E(i) = |E 0 |Re(e- iut e rad ) (2.25) 



with constant amplitude |E 0 | and polarization vector e ra d. 20 If to lies 
close to the atomic resonance frequency then the perturbing electric 
field puts the atom into a superposition of different states and induces 
an oscillating electric dipole moment on the atom (see Exercise 2.10). 
The calculation of the stimulated transition rate requires time-dependent 
perturbation theory (TDPT), as described in Chapter 7. However, the 
treatment from first principles is lengthy and we shall anticipate some 
of the results so that we can see how spectra relate to the underlying 
structure of the atomic energy levels. This does not require an exact 
calculation of transition rates, but we only need to determine whether 
the transition rate has a finite value or whether it is zero (to first order), 
i.e. whether the transition is allowed and gives a strong spectral line, or 
is forbidden. 

The result of time-dependent perturbation theory is encapsulated in 
the golden rule (or Fermi’s golden rule); 21 this states that the rate of 
transitions is proportional to the square of the matrix element of the 
perturbation. The Hamiltonian that describes the time-dependent in- 
teraction with the field in eqn 2.25 is H' = er • E (£), where the electric 
dipole operator is — er. 22 This interaction with the radiation stimulates 
transitions from state 1 to state 2 at a rate 23 



Rate oc |eEo 




V>2 (r-erad)V’i d 3 r 



2 



|eE 0 | 2 x |(2|r -e rad |1)| 2 . 



(2.26) 

The concise expression in Dirac notation is convenient for later use. This 
treatment assumes that the amplitude of the electric field is uniform over 
the atom so that it can be taken outside the integral over the atomic 
wavef unctions, i.e. that Eo does not depend on r. 24 We write the dipole 
matrix element as the product 



(2|r-e rad |l ) = D 12 l ang . (2.27) 



The radial integral is 25 



1 8 

This quantity is related to the quan- 
tum mechanical expectation value of 
the potential energy (p.e.); as in the 
Bohr model the total energy is E = 
(p.e.) /2. 



The interaction of atoms with the os- 
cillating magnetic field in such a wave is 
considerably weaker; see Appendix C. 



20 /s. 

The unit vector e rad gives the direc- 
tion of the oscillating electric field. For 
example, for the simple case of linear 
polarization along the x-dods e rad = e x 
and the real part of e~ lujt is cos (cut); 
therefore E (t) = |Eq | cos(cut) e x . 



21 

See quantum mechanics texts such as 
Mandl (1992). 

22 

This is analogous to the interaction 
of a classical dipole with an electric 
field. Atoms do not have a perma- 
nent dipole moment, but one is induced 
by the oscillating electric field. For a 
more rigorous derivation, see Woodgate 
(1980) or Loudon (2000). 

23 

The maximum transition rate occurs 
when cu, the frequency of the radiation, 
matches the transition frequency cu± 2 , 
as discussed in Chapter 7. Note, how- 
ever, that we shall not discuss the so- 
called ‘density of states’ in the golden 
rule since this is not straightforward for 
monochromatic radiation. 

24 

In eqn 2.25 the phase of the wave is 
actually (cut — k • r), where r is the co- 
ordinate relative to the atom’s centre 
of mass (taken to be the origin) and k 
is the wavevector. We assume that the 
variation of phase k • r is small over the 
atom (kao <C 27 r). This is equivalent to 
A ao, i.e. the radiation has a wave- 
length much greater than the size of the 
atom. This is called the dipole approx- 
imation. 

25 Note that D 12 = D 21 . 
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D 12 = / Rn 2 ,h (r) rR nuh (r)r 2 dr. (2.28) 

Jo 

The angular integral is 

/* 277 /*7T 

Yn g = / / Vi*, ma (0, 0) f • e rad y, 1>mi (0, </>) sin 0 d0 , (2.29) 

JO Jo 

where r = r/r. The radial integral is not normally zero although it can 
be small for transitions between states whose radial wavefunctions have 
a small overlap, e.g. when n\ is small and n 2 is large (or the other way 
round). In contrast, the X ang = 0 unless strict criteria are satisfied — 
these are the selection rules. 



2.2.1 Selection rules 



26 

If either the atoms have random ori- 
entations (e.g. because there is no ex- 
ternal field) or the radiation is unpo- 
larized (or both), then an average over 
all angles must be made at the end of 
the calculation. 



The selection rules that govern allowed transitions arise from the angular 
integral in eqn 2.29 which contains the angular dependence of the inter- 
action r • e ra d for a given polarization of the radiation. The mathematics 
requires that we calculate X ang for an atom with a well-defined quanti- 
sation axis (invariably chosen to be the z-axis) and radiation that has a 
well-defined polarization and direction of propagation. This corresponds 
to the physical situation of an atom experiencing the Zeeman effect of an 
external magnetic field, as described in Section 1.8; that treatment of the 
electron as a classical oscillator showed that the components of differ- 
ent frequencies within the Zeeman pattern have different polarizations. 
We use the same nomenclature of 7 r- and cr-transitions here; transverse 
observation refers to radiation emitted perpendicular to the magnetic 
field, and longitudinal observation is along the z-axis. 26 

To calculate Z ang we write the unit vector ? in the direction of the 
induced dipole as: 

^ 1 , ^ 

r = - ( xe x +ye y + ze z ) 

= sin 6 cos (j) e x + sin # sin y) e y + cos^e^ . (2.30) 



Expressing the functions of 0 and cj> in terms of spherical harmonic func- 
tions as 

I 2tt 

sin 0 cos <p = \ — (Yi -1 - Yi,i) , 



leads to 



/ 27 r 

sin 9 sin <p = iy — (Y,-i + Y,i) , 

1 47T 

cos 6 = \ — Y , 0 , 



— -rr e x ~r ie y ^ —e x + ie y 

r oc Y- 1 + Y,oe z + Y,i 



1 J2 

We write the general polarization vector as 



®rad — A a - 



G X 1^?/ A . . 

h A w e z + A a 



V2 



V2 



6 x T i 



V2 



(2.31) 



(2.32) 

(2.33) 
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where A n depends on the component of the electric field along the z- 
axis and the component in the xy- plane is written as a superposition of 
two circular polarizations with amplitudes A a + and A a - (rather than 
in terms of linear polarization in a Cartesian basis). 27 Similarly, the 
classical motion of the electron was written in terms of three eigenvectors 
in Section 1.8: an oscillation along the z-axis and circular motion in the 
xy- plane, both clockwise and anticlockwise. 

From the expression for r in terms of the angular functions Yi jm (Q, (f)) 
with Z = 1 we find that the dipole induced on the atom is proportional 
to 28 

F ' e rad Aj-^1,-1 + A ^1,0 + A+^1, + 1 • (2.34) 

The following sections consider the transitions that arise from these three 
terms. 29 

7r-transitions 

The component of the electric field along the 2 - axis A z induces a dipole 
moment on the atom proportional to e ra d • e z = cos 0 and the integral 
over the angular parts of the wavefunctions is 

p2tt pTT 

/ Y l i m2 (e,cj>)coseY lumi (e,cp)smededcp. (2.35) 
Jo Jo 

To determine this integral we exploit the symmetry with respect to ro- 
tations about the z-axis. 30 The system has cylindrical symmetry, so the 
value of this integral is unchanged by a rotation about the z- axis through 
an angle </> 0 : 

2I„ g = . (2.36) 

This equation is satisfied if either T£ ng = 0 or raq = m/ 2 . For this 
polarization the magnetic quantum number does not change, A mi = 0. 31 

cr-transitions 

The component of the oscillating electric field in the xy- plane excites cr- 
transitions. Equation 2.34 shows that the circularly-polarized radiation 
with amplitude A a + excites an oscillating dipole moment on the atom 
proportional Yyi oc sin^e 1 ^, for which the angular integral is 

p2tT p7 T 

= / / y h,m 2 (0, 0) Sin 9 Y lumi (9, <j>) sin 9 d9 d </> . (2.37) 

Jo Jo 

Again, consideration of symmetry with respect to rotation about the z- 
axis through an arbitrary angle shows that X^ ng = 0 unless raq — mi 2 + 
1 = 0. The interaction of an atom with circularly-polarized radiation of 
the opposite handedness leads to a similar integral but with e 1 ^ —> e~ u 
this integral Z^ ng = 0 unless mq — mi 2 — 1 = 0. Thus the selection rule 
for the cr-transitions is A mi = ±1. 

We have found the selection rules that govern A mi for each of the 
three possible polarizations of the radiation separately. These apply 
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We will see that the labels 7 r, cr+ and 
a refer to the transition that the radi- 
ation excites; for this it is only impor- 
tant to know how the electric field be- 
haves at the position of the atom. The 
polarization state associated with this 
electric field, e.g. whether it is right- 
or left-handed circularly-polarized radi- 
ation, also depends on the direction of 
propagation (wavevector) , but we shall 
try to avoid a detailed treatment of the 
polarization conventions in this discus- 
sion of the principles. Clearly, however, 
it is important to have the correct po- 
larization when setting up actual exper- 
iments. 

28 

The eigenvectors have the following 
properties: 

e x + ie y e x — ie y _ 

— - 

and 

e x db ie y e x ± ie^ _ n 

29 

In spherical tensor notation 
(Woodgate 1980) the three vector 
components are written A— i, Ao and 
A+ 1, which is convenient for more 
general use; but writing eqn 2.34 as 
given emphasises that the amplitudes 
A represent the different polarizations 
of the radiation and the spherical 
harmonics come from the atomic 
response (induced dipole moment). 

30 

Alternative methods are given below 
and in Exercise 2.9. 

31 

We use mi to distinguish this quan- 
tum number from m s , the magnetic 
quantum number for spin angular mo- 
mentum that is introduced later. Spe- 
cific functions of the spatial variables 
such as Yi m and do not need 

this additional subscript. 
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when the polarized light interacts with an atom that has a well-defined 
orientation, e.g. an atom in an external magnetic field. If the light 
is unpolarized or there is no defined quantisation axis, or both, then 
A mi = 0, =bl. 



32 

Similar behaviour arises in the clas- 
sical model of the normal Zeeman ef- 
fect in Section 1.8, but the quantum 
treatment in this section shows that 
it is a general feature of longitudinal 
observation — not just for the normal 
Zeeman effect. 



Example 2.1 Longitudinal observation 

Electromagnetic radiation is a transverse wave with its oscillating elec- 
tric field perpendicular to the direction of propagation, e ra d • k = 0. 
Thus radiation with wavevector k = ke z has A z = 0 and ^-transitions 
do not occur. 32 Circularly-polarized radiation (propagating along the z- 
axis) is a special case for which transitions occur with either A mi = +1 
or A mi = — 1, depending on the handedness of the radiation, but not 
both. 



2.2.2 Integration with respect to 6 

In the angular integral the spherical harmonic functions with l = 1 (from 
eqn 2.34) are sandwiched between the angular momentum wavefunctions 
of the initial and final states so that 



^ang 



/7 



hum 



Y htmi sin 9d6 d(f> . 



(2.38) 



See the references on angular mo- 
mentum in quantum mechanics; the 
reason why the magnetic quantum 
numbers add is obvious from <f>( </ !>). 



To calculate this angular integral we use the following formula: 33 

Id, m Iq, mi = A Y/i_i_i 5 rn\-\-m T B Tq — l ? mi+ra , (2.39) 



34 We have sinO d0 d</> 

— fy,z $m' ,m- This reduces to the nor- 
malisation in Table 2.1 when l' = l and 
m' = m. 



where A and B are constants whose exact values need not concern us. 
Thus from the orthogonality of the spherical harmonics 34 we find 

-^ang A $l2,li + l$rri2,rni-\-m T B — 



35 This argument applies only for elec- 
tric dipole radiation. Higher-order 
terms, e.g. quadrupole radiation, can 
give Al > 1. 



The delta functions give the selection rule found previously, namely 
A mi = m, where m = 0, ±1 depending on the polarization, and also 
Al = ±1. In the mathematics the functions with l = 1 that represent 
the interaction with the radiation are sandwiched between the orbital 
angular momentum eigenfunctions of the initial and final states. Thus 
the rule Al = ±1 can be interpreted as conservation of angular momen- 
tum for a photon carrying one unit of angular momentum, h (Fig. 2.8 
illustrates this reasoning for the case of total angular momentum). 35 The 
changes in the magnetic quantum number are also consistent with this 
picture^ — the component of the photon’s angular momentum along the 
z - axis being Ami = 0, ±1. Conservation of angular momentum does not 
explain why Al ^ 0 — this comes about because of parity, as explained 
below. 



2.2.3 Parity 

Parity is an important symmetry property throughout atomic and molec- 
ular physics and its general use will be explained before applying it to 
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selection rules. The parity transformation is an inversion through the 
origin given by r — v — r. This is equivalent to the following transforma- 
tion of the polar coordinates: 

0 — » 7 t — 6 : a reflection , 

<p — > (p + 7r : a rotation . 

The reflection produces a mirror image of the original system and parity 
is also referred to as mirror symmetry. The mirror image of a hydrogen 
atom has the same energy levels as those in the original atom since the 
Coulomb potential is the same after reflection. It turns out that all the 
electric and magnetic interactions ‘look the same’ after reflection and 
all atoms have parity symmetry. 36 To find the eigenvalues for parity we 
use the full quantum mechanical notation, with hats to distinguish the 
operator P from its eigenvalue P in the equation 

Pip = P%p, (2.40) 

from which it follows that P 2 'ip = P 2 ip. Two successive parity operations 
correspond to there being no change (the identity operator), i.e. r — > 
— r — > r. Thus P 2 — 1. Therefore the parity operator has eigenvalues 
P = 1 and —1 that correspond to even and odd parity wavefunctions, 
respectively: 

P pj = 'ip or P pj = — 'ip . 

Both eigenvalues occur for the spherical harmonic functions, 

Plj,m = (-1) l Y Um . (2.41) 

The value of the angular integral does not change in a parity trans- 
formation 37 so 

= (~1) h+h + 1 ■ (2.42) 

Thus the integral is zero unless the initial and final states have opposite 
parity (see Exercise 2.12). In particular, electric dipole transitions re- 
quire an odd change in the orbital angular momentum quantum number 
(A l 7^ 0). 38 

The treatment above of the parity operator acting on a wavefunction 
is quite general and even in complex atoms the wavefunctions have a 
definite parity. The selection rules we have discussed in this section and 
others are tabulated in Appendix C. If the electric dipole matrix element 
is zero between two states then other types of transition may occur but 
at a rate many orders of magnitude slower than allowed transitions. 

The allowed transitions between the n = 1, 2 and 3 shells of atomic 
hydrogen are shown in Fig. 2.2, as an example of the selection rules. 
The 2s configuration has no allowed transitions downwards; this makes 
it metastable, i.e. it has a very long lifetime of about 0.125 s. 39 

Finally, a comment on the spectroscopic notation. It can be seen 
in Fig. 2.2 that the allowed transitions give rise to several series of 



36 This can be proved formally in 
quantum mechanics by showing that 
the Hamiltonians for these interactions 
commute with the parity operator. The 
weak interaction in nuclear physics does 
not have mirror symmetry and violates 
parity conservation. The extremely 
small effect of the weak interaction on 
atoms has been measured in exceed- 
ingly careful and precise experiments. 



37 See, for example, Mandl (1992). 



OO 

The radial integral is not changed by 
the parity transformation. 



OQ 

This special feature is used in the ex- 
periment described in Section 2.3.4. 
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Fig. 2.2 Allowed transitions between 
the configurations of hydrogen obey the 
selection rule Al = ±1. The configu- 
rations with l = 0, 1, 2, 3, 4, . . . are la- 
belled s, p, d, f, g, and so on alphabeti- 
cally (the usual convention). In the spe- 
cial case of hydrogen the energy does 
not depend on the quantum number l. 



40 For hydrogen this is the Lyman se- 
ries, as marked on Fig. 1.1; however, 
p-series is a general name. 

41 These names reflect the appearance 
of the lines in the first experimental ob- 
servations. 



42 

By considering elliptical orbits, 
rather than just circular ones, Som- 
merfeld refined Bohr’s theory to 
obtain a relativistic expression for the 
energy levels in hydrogen that gave 
very accurate predictions of the fine 
structure; however, details of that 
approach are not given here. 




lines. The series of lines to the ground configuration is called the p- 
series, where p stands for principal — this is the only series observed in 
absorption 40 — hence p labels configurations with 1 = 1. The s-series of 
lines goes from l = 0 configurations (to a level with / = 1), and similarly 
the d-series goes from l = 2 configurations; s and d stand for sharp and 
diffuse, respectively. 41 



2.3 Fine structure 

Relativistic effects lead to small splittings of the atomic energy levels 
called fine structure. We estimated the size of this structure in Section 

1.4 by comparing the speed of electrons in classical orbits with the speed 
of light. 42 In this section we look at how to calculate fine structure 
by treating relativistic effects as a perturbation to the solutions of the 
Schrodinger equation. This approach requires the concept that electrons 
have spin. 
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2.3.1 Spin of the electron 

In addition to the evidence provided by observations of the fine structure 
itself, that is described in this section, two other experiments showed 
that the electron has spin angular momentum, not just orbital angular 
momentum. One of these pieces of experimental evidence for spin was 
the observation of the so-called anomalous Zeeman effect. For many 
atoms, e.g. hydrogen and sodium, the splitting of their spectral lines 
in a magnetic field does not have the pattern predicted by the normal 
Zeeman effect (that we found classically in Section 1.8). This anomalous 
Zeeman effect has a straightforward explanation in terms of electron 
spin (as shown in Section 5.5). The second experiment was the famous 
Stern-Gerlach experiment that will be described in Section 6. 4.1. 43 

Unlike orbital angular momentum, spin does not have eigenstates that 
are functions of the angular coordinates. Spin is a more abstract con- 
cept and it is convenient to write its eigenstates in Dirac’s ket notation 
as | s m s ). The full wavefunction for a one-electron atom is the product of 
the radial, angular and spin wavefunctions: T = R n y(r) Yi :Tn ( 6 , </>) \sm s ). 
Or, using ket notation for all of the angular momentum, not just the spin, 

^ = Rn,i(r) I lmism s ). (2.43) 

These atomic wavefunctions provide a basis in which to calculate the 
effect of perturbations on the atom. However, some problems do not 
require the full machinery of (degenerate) perturbation theory and for 
the time being we shall treat the orbital and spin angular momenta by 
analogy with classical vectors. To a large extent this vector model is 
intuitively obvious and we start to use it without formal derivations. 
But note the following points. An often-used shorthand for the spin 
eigenfunctions is spin- up: 

|s= m s = \) = It), (2.44) 

and similarly ||) for the m s = — \ state (spin-down). However, in quan- 
tum mechanics the angular momentum cannot be completely aligned 
‘up’ or ‘down’ with respect to the z-axis, otherwise the x- and ^/-comp- 
onents would be zero and we would know all three components simul- 
taneously. 44 The vector model mimics this feature with classical vectors 
drawn with length |s| = yj s(s + 1) = a/ 3/2. (Only the expectation 
value of the square of the angular momentum has meaning in quan- 
tum mechanics.) The spin-up and spin-down states are as illustrated 
in Fig. 2.3 with components along the z- axis of ±-|. We can think of 
the vector as rotating around the z-axis, or just having an undefined 
direction in the xy- plane corresponding to a lack of knowledge of the x- 
and //-components (see also Grant and Phillips 2001). 

The name ‘spin’ invokes an analogy with a classical system spinning on 
its axis, e.g. a sphere rotating about an axis through its centre of mass, 
but this mental picture has to be treated with caution; spin cannot be 
equal to the sum of the orbital angular momenta of the constituents since 
that will always be an integer multiple of h. In any case, the electron is 



43 The fine structure, anomalous Zee- 
man effect and Stern-Gerlach exper- 
iment all involve the interaction of 
the electron’s magnetic moment with a 
magnetic field — the internal field of the 
atom in the case of fine structure. Stern 
and Gerlach detected the magnetic in- 
teraction by its influence on the atom’s 
motion, whereas the Zeeman effect and 
fine structure are observed by spectro- 
scopy. 



44 This is not possible since the oper- 
ators for the x - , y- and ^-components 
of angular momentum do not commute 
(save in a few special cases; we can 
know that s x = s y = s z = 0 if s = 0). 



36 The hydrogen atom 



Fig. 2.3 The representation of (a) spin- 
up and (b) spin-down states as vectors 
precessing around the z-axis. 





a structureless elementary particle with no measurable size. So we are 
left with the experimental fact that the electron has an intrinsic spin 
angular momentum of h / 2 and these half- integer values are perfectly 
acceptable within the general theory of angular momentum in quantum 
mechanics. 



2.3.2 The spin— orbit interaction 

The Schrodinger equation is non-relativistic, as can readily be seen by 
looking at the kinetic-energy operator that is equivalent to the non- 
relativistic expression p 2 /2m e . Some of the relativistic effects can be 
taken into account as follows. An electron moving through an electric 
field E experiences an effective magnetic field B given by 

B = — 4tv x E. (2.45) 

c 2 



45 The Biot-Savart law for the magnetic 
field from a current flowing along a 
straight wire can be recovered from the 
Lorentz transformation and Coulomb’s 
law (Griffiths 1999). However, this 
link can only be made in this direction 
for simple cases and generally the phe- 
nomenon of magnetism cannot be ‘de- 
rived’ in this way. 



This is a consequence of the way an electric field behaves under a Lorentz 
transformation from a stationary to a moving frame in special relativity. 
Although a derivation of this equation is not given here, it is certainly 
plausible since special relativity and electromagnetism are intimately 
linked through the speed of light c = 1 / yj eo/io- This equation for the 
speed of electromagnetic waves in a vacuum comes from Maxwell’s equa- 
tions; eo being associated with the electric field and go with the magnetic 
field. Rearrangement to give go = 1/ (eoc 2 ) suggests that magnetic 
fields arise from electrodynamics and relativity. 45 

We now manipulate eqn 2.45 into a convenient form, by substituting 
for the electric field in terms of the gradient of the potential energy V 
and unit vector in the radial direction: 



E = 



ldVr 

e dr r 



(2.46) 



The factor of e comes in because the electron’s potential energy V equals 
its charge — e times the electrostatic potential. From eqn 2.45 we have 



B = 



1 

m e c 2 



J _dV 

er dr 



h 

r x m e v = 

m e c z 



er or 



(2.47) 



where the orbital angular momentum is hi = r x m e v. The electron 
has an intrinsic magnetic moment pi = —g s g bs, where the spin has a 
magnitude of |s| = s = 1/2 (in units of h) and g s ~ 2, so the moment 
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has a magnitude close to one Bohr magneton (//b = eh/2m e ). The 
interaction of the electron’s magnetic moment with the orbital field gives 
the Hamiltonian 



H = — /jL • B 



= Qst^ bS • 



h 

m e c 2 



]_dV\ 

er dr J 



1 . 



(2.48) 



However, this expression gives energy splittings about twice as large 
as observed. The discrepancy comes from the Thomas precession — a 
relativistic effect that arises because we are calculating the magnetic 
field in a frame of reference that is not stationary but rotates as the 
electron moves about the nucleus. The effect is taken into account by 
replacing g s with g s — 1 ~ l. 46 Finally, we find the spin-orbit interaction, 
including the Thomas precession factor, is 47 

< 2 - 49 > 

For the Coulomb potential in hydrogen we have 



1 dV e 2 /47reo 

r dr r 3 



(2.50) 



The expectation value of this Hamiltonian gives an energy change of 48 



Er-C 



h 2 e 2 
2mlc 2 47reo 




(2.51) 



The separation into a product of radial and angular expectation values 
follows from the separability of the wavefunction. The integral (l/r 3 ) is 
given in eqn 2.23. However, we have not yet discussed how to deal with 
interactions that have the form of dot products of two angular momenta; 
let us start by defining the total angular momentum of the atom as the 
sum of its orbital and spin angular momenta, 



j = 1 + s. 



(2.52) 



This is a conserved quantity for a system without any external torque 
acting on it, e.g. an atom in a field- free region of space. This is true 
both in classical and quantum mechanics, but we concentrate on the 
classical explanation in this section. The spin-orbit interaction between 
1 and s causes these vectors to change direction, and because their sum 
is constrained to be equal to j they move around as shown in Fig. 2. 4. 49 
Squaring and rearranging eqn 2.52, we find that 2s • 1 = j 2 — l 2 — s 2 . 
Hence we can find the expectation value in terms of the known values 
for (j 2 ) , (l 2 ) and (s 2 ) as 

( s ' !) = 2 0’ + 1) “ + 1) - 5 (s + 1)} . (2.53) 

Thus the spin-orbit interaction produces a shift in energy of 

Es-O = ^ {j {j + 1) - l + 1) - 5 (s + 1)} , 



46 This is almost equivalent to using 
g s / 2 ~ 1, but g s — 1 is more accu- 
rate at the level of precision where the 
small deviation of g s from 2 is impor- 
tant (Haar and Curtis 1987). For fur- 
ther discussion of Thomas precession 
see Cowan (1981), Eisberg and Resnick 
(1985) and Munoz (2001). 

47 We have derived this classically, e.g. 
by using h\ = r x m e v. However, the 
same expression can be obtained from 
the fully relativistic Dirac equation for 
an electron in a Coulomb potential by 
making a low- velocity approximation, 
see Sakurai (1967). That quantum me- 
chanical approach justifies treating 1 
and s as operators. 

48 

Using the approximation g s — 1 ~ 1. 




1 



Fig. 2.4 The orbital and spin angular 
momenta add to give a total atomic an- 
gular momentum of j. 

49 In this precession about j the magni- 
tudes of 1 and s remain constant. The 
magnetic moment (proportional to s) 
is not altered in an interaction with 
a magnetic field, and because of the 
symmetrical form of the interaction in 
eqn 2.49, we do not expect 1 to be- 
have any differently. See also Blundell 
(2001) and Section 5.1. 



(2.54) 
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50 

As shown in Section 1.9, m e acao = 
h and hcRoo = (e 2 / 47reo) / (2ao) • 



51 Another short form found in the lit- 
erature is 2 2 P 1 /2 and 2 2 P 3 / 2 - 



where the spin-orbit constant /3 is (from eqns 2.51 and 2.23) 



p = 



h 2 



l 



2 m 2 c 2 47re 0 ( na 0 ) 3 l (l + \) (l + 1) 



(2.55) 



A single electron has s = \ so, for each l, its total angular momentum 
quantum number j has two possible values: 

j = l + \ or l- 1 -. 

From eqn 2.54 we find that the energy interval between these levels, 
AF’s-o = Ej =l+ i — Ej =l _ i , is 



AE's-o = /?(/ + !) = 



a 2 hcRoo 
n 3 «(/ + 1) ’ 



(2.56) 



Or, expressed in terms of the gross energy E(n ) in eqn 1.10, 50 

< 2 - 57 > 

This agrees with the qualitative discussion in Section 1.4, where we 
showed that relativistic effects cause energy changes of order a 2 times 
the gross structure. The more complete expression above shows that the 
energy intervals between levels decrease as n and l increase. The largest 
interval in hydrogen occurs for n = 2 and Z = 1; for this configuration 
the spin-orbit interaction leads to levels with j = 1/2 and j = 3/2. 
The full designation of these levels is 2p 2 P 1 /2 and 2p 2 P 3 / 2 , in the no- 
tation that will be introduced for the L6'-coupling scheme. But some of 
the quantum numbers (defined in Chapter 5) are superfluous for atoms 
with a single valence electron and a convenient short form is to denote 
these two levels by 2Fi/ 2 and 2 P 3 / 2 ; these correspond to nPj, where 
P represents the (total) orbital angular momentum for this case. (The 
capital letters are consistent with later usage.) Similarly, we may write 
2 Si/ 2 for the 2 s 2 Si/ 2 level; 3D 3 / 2 and 3D 5 / 2 for the j = 3/2 and 5/2 
levels, respectively, that arise from the 3d configuration. 51 But the full 
notation must be used whenever ambiguity might arise. 



2.3.3 The fine structure of hydrogen 

As an example of fine structure, we look in detail at the levels that arise 
from the n = 2 and n = 3 shells of hydrogen. Equation 2.54 predicts 
that, for the 2p configuration, the fine- structure levels have energies of 

E s -o (2 P 1/2) = — /^2 P , 

Es—o (2 P 3 /2) = 2 5 

as shown in Fig. 2.5(a). For the 3d configuration 

E s -o (3D 3 / 2 ) = - -/fed , 

E s - o (3D 5 / 2 ) sS= Psd 5 
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as shown in Fig. 2.5(b). For both configurations, it is easy to see that 
the spin-orbit interaction does not shift the mean energy 

E = (2j + 1) Ej (n, l) + (2 / + 1) E y (n, Z) , (2.58) 

where j' = l — 1/2 and j = l + 1/2 for the two levels. This calculation of 
the ‘centre of gravity’ for all the states takes into account the degeneracy 
of each level. 

The spin-orbit interaction does not affect the 2 Si/ 2 or 3 Si/ 2 so we 
might expect these levels to he close to the centre of gravity of the 
configurations with l > 0. This is not the case. Fig. 2.6 shows the 
energies of the levels for the n = 3 shell given by a fully relativistic 
calculation. We can see that there are other effects of similar magnitude 
to the spin-orbit interaction that affect these levels in hydrogen. Quite 
remarkably, these additional relativistic effects shift the levels by just the 
right amount to make nPp 2 levels degenerate with the n Si/ 2 levels, and 
nP 3 / 2 degenerate with nD 3 / 2 . This structure does not occur by chance, 
but points to a deeper underlying cause. The full explanation of these 
observations requires relativistic quantum mechanics and the technical 
details of such calculations he beyond the scope of this book. 52 We shall 
simply quote the solution of the Dirac equation for an electron in a 
Coulomb potential; this gives a formula for the energy Ebirac (iq j) that 
depends only on n and j, i.e. it gives the same energy for levels of the 
same n and j but different /, as in the cases above. In a comparison of the 
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Fig. 2.5 The fine structure of hydro- 
gen. The fine structure of (a) the 2p 
and (b) the 3d configurations are drawn 
on different scales: ^2p is consider- 

ably greater than d3d- All p- and d- 
configurations look similar apart from 
an overall scaling factor. 

52 

See graduate-level quantum mechan- 
ics texts, e.g. Sakurai (1967) and Series 
(1988). 
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Fig. 2.6 The theoretical positions of the energy levels of hydrogen calculated by the fully relativistic theory of Dirac depend 
on n and j only (not /), as shown in this figure for the n = 3 shell. In addition to the spin-orbit interaction, the effects that 
determine the energies of these levels are: the relativistic mass correction and, for s-electrons only, the Darwin term (that 
accounts for relativistic effects that occur at small r, where the electron’s momentum becomes comparable to m e c). 
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53 

The Dirac equation predicts that the 
electron has g s = 2 exactly. 



54 Lamb and Retherford used a radio- 
frequency to drive the 2 Si/ 2 - 2 Pi /2 
transition directly. This small en- 
ergy interval, now know as the Lamb 
shift, cannot be resolved in conven- 
tional spectroscopy because of Doppler 
broadening, but it can be seen us- 
ing Doppler-free methods as shown in 
Fig. 8.7. 

55 The QED calculation of the Lamb 
shift is described in Sakurai (1967). 

56 

Broadly speaking, in a mathemat- 
ical treatment these vacuum fluctua- 
tions correspond to the zero-point en- 
ergy of quantum harmonic oscillators, 
i.e. the lowest energy of the modes of 
the system is not zero but Jtlj/2. 

57 

QED also explains why the ^-factor 
of the electron is not exactly 2. Pre- 
cise measurements show that g s = 
2.002 319 304 3718 (current values for 
the fundamental constants can be 
found on the NIST web site, and those 
of other national standards laborato- 
ries). See also Chapter 12. 



exact relativistic solution of the Dirac equation and the non-relativistic 
energy levels, three relativistic effects can be distinguished. 

(a) There is a straightforward relativistic shift of the energy (or equiva- 
lently mass), related to the binomial expansion of 7 = ( 1 — u 2 /c 2 ) -1 / 2 , 
in eqn 1.16. The term of order v 2 / c 2 gives the non-relativistic ki- 
netic energy p 2 /2m e . The next term in the expansion is propor- 
tional to u 4 /c 4 and gives an energy shift of order v 2 /c 2 times the 
gross structure — this is the effect that we estimated in Section 1.4. 

(b) For electrons with l 7 ^ 0, the comparison of the Dirac and Schrodinger 
equations shows that there is a spin-orbit interaction of the form 
given above, with the Thomas precession factor naturally included . 53 

(c) For electrons with l = 0 there is a Darwin term proportional to 
\vp (r = 0 )| 2 that has no classical analogue (see Woodgate (1980) for 
further details). 

That these different contributions conspire together to perturb the 
wavefunctions such that levels of the same n and j are degenerate seems 
improbable from a non-relativistic point of view. It is worth reiterat- 
ing the statement above that this structure arises from the relativistic 
Dirac equation; making an approximation for small v 2 /c 2 shows that 
these three corrections, and no others, need to be applied to the (non- 
relativistic) energies found from the Schrodinger equation. 

2.3.4 The Lamb shift 

Figure 2.7 shows the actual energy levels of the n = 2 and n = 3 shells. 
According to relativistic quantum theory the 2 Si / 2 level should be ex- 
actly degenerate with 2 P\/ 2 because they both have n = 2 and j = 1/2, 
but in reality there is an energy interval between them, E (2S 1 / 2 ) — 
£’(2P 1 / 2 ) — 1GHz. The shift of the 2 Si / 2 level to a higher energy 
(lower binding energy) than the F^Dirac (n = 2, j = 1/2) is about one- 
tenth of the interval between the two fine-structure levels, E (2P 3 / 2 ) — 
E (2Piy 2 ) ~ 11 GHz. Although small, this discrepancy in hydrogen was 
of great historical importance in physics. For this simple one-electron 
atom the predictions of the Dirac equation are very precise and that 
theory cannot account for Lamb and Retherford’s experimental mea- 
surement that the 2 Si / 2 level is indeed higher than the 2P 1 ^ 2 level . 54 
The explanation of this Lamb shift goes beyond relativistic quantum me- 
chanics and requires quantum electrodynamics (QED) — the quantum 
field theory that describes electromagnetic interactions. Indeed, the ob- 
servation of the Lamb shift experiment was a stimulus for the develop- 
ment of this theory . 55 An intriguing feature of QED is so-called vacuum 
fluctuations — regions of free space are not regarded as being completely 
empty but are permeated by fluctuating electromagnetic fields . 56 The 
QED effects lead to a significant energy shift for electrons with l = 0 
and hence break the degeneracy of 2 Si / 2 and 2P 1 / 2 . 57 The largest QED 
shift occurs for the 1 Si / 2 ground level of hydrogen but there is no other 
level nearby and so a determination of its energy requires a precise mea- 
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(c) 




surement of a large frequency. Nowadays this can be achieved by laser 
spectroscopy (Chapter 8) but the near degeneracy of the two j = 1/2 
levels with n = 2 was crucial in Lamb’s experiment. 58 Another im- 
portant feature in that experiment was the metast ability of the 2 Si / 2 
level, whose lifetime was given in Section 2.2.3. That level decays ^ 10 8 
times more slowly than that of 2P!/ 2 . In an atomic beam of hydro- 
gen (at room temperature) the atoms have typical velocities of about 
3000 ms -1 and atoms excited into the 2p configuration travel an aver- 
age distance of only 5 x 10 -6 m before decaying with the emission of 
Lyman-a radiation. In contrast, metastable atoms travel the full length 
of the apparatus (~ 1 m) and are de-excited when they collide with a 
detector (or the wall of the vacuum chamber). Hydrogen, and hydro- 
genic systems, are still used for experimental tests of fundamental theory 
because their simplicity allows very precise predictions. 



2.3.5 Transitions between fine-structure levels 

Transitions in hydrogen between the fine-structure levels with principal 
quantum numbers n = 2 and 3 give the components of the Balmer-a 
line shown in Fig. 2.7; in order of increasing energy, the seven allowed 



Fig. 2.7 The fine structure of the 
n = 2 and n = 3 shells of hydro- 
gen and the allowed transitions between 
the levels. According to the Dirac 
equation, the 2S 1/ / 2 and 2P x / 2 levels 
should be degenerate, but they are not. 
The measured positions show that the 
2s 2 S 1 /2 level is shifted upwards relative 
to the position FTirac ( n — 2, j = 1/2) 
and is therefore not degenerate with 
the 2p 2 P 1 / 2 level. Such a shift occurs 
for all the s-electrons (but the size of 
the energy shift decreases with increas- 
ing n). The explanation of this shift 
takes us beyond relativistic quantum 
mechanics into the realm of quantum 
electrodynamics (QED) — the quantum 
field theory that describes electromag- 
netic interactions. 



Fig. 2.8 The conservation of total an- 
gular momentum in electric dipole tran- 
sitions that gives the selection rule in 
eqn 2.59 can be represented as vector 
addition. The photon has one unit of 
angular momentum, and so to go from 
level ji to j 2 the vectors must form a 
triangle, as shown for the case of (a) 
jl = 1/2 to j 2 = 1/2, (b) ji = 1/2 to 
J 2 = 3/2 and (c) j% = 3/2 to j 2 = 3/2. 



58 

Higher shells have smaller shifts be- 
tween the j = 1/2 levels. 
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transitions between the levels with different j are as follows: 




These obey the selection rule Al = ±1 but an additional rule prevents a 
transition between 2P 1 / 2 and 3 D 5 / 2 , namely that the change of the total 
angular momentum quantum number in an electric dipole transition 
obeys 



This selection rule may be explained by angular momentum conserva- 
tion (as mentioned in Section 2.2.2). This rule can be expressed in terms 
of vector addition, as shown in Fig. 2.8; the conservation condition is 
equivalent to being able to form a triangle from the three vectors rep- 
resenting j of the initial state, the final state, and a unit vector for the 
(one unit of) angular momentum carried by the photon. Hence, this se- 
lection rule is sometimes referred to as the triangle rule. The projection 
of j along the z-axis can change by A nrij = 0, d=l. (Appendix C gives a 
summary of all selection rules.) 

Further reading 

Much of the material covered in this chapter can be found in the intro- 
ductory quantum mechanics and atomic physics texts listed in the Ref- 
erences. For particular topics the following are useful: Segre (1980) gives 
an overview of the historical development, and Series (1988) reviews the 
work on hydrogen, including the important Lamb shift experiment. 



Aj = 0,±l. 



(2.59) 



Exercises 



(2.1) Angular-momentum eigenfunctions 



(2.2) Angular-momentum eigenfunctions 



(a) Verify that all the eigenfunctions with l — 1 
are orthogonal to Yo,o- 



(a) Find the eigenfunction with orbital angular 
momentum quantum number l and magnetic 
quantum number m m l — 1. 



(b) Verify that all the eigenfunctions with l = 1 
are orthogonal to those with 1 = 2. 



(b) Verify that Yz,z-i is orthogonal to YJ_ i,z-i. 
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(2.3) Radial wavefunctions 

Verify eqn 2.23 for n — 2, l = 1 by calculating the 
radial integral (for Z — 1 ). 

(2.4) Hydrogen 

For a hydrogen atom the normalised wavefunction 
of an electron in the Is state, assuming a point 
nucleus, is 



V’M = 




where ao is the Bohr radius. Find an approximate 
expression for the probability of finding the elec- 
tron in a small sphere of radius <C ao centred on 
the proton. What is the electronic charge density 
in this region? 

(2.5) Hydrogen 

The Balmer-a spectral line is observed from a 
(weak) discharge in a lamp that contains a mixture 
of hydrogen and deuterium at room temperature. 
Comment on the feasibility of carrying out an ex- 
periment using a Fabry-Perot etalon to resolve (a) 
the isotope shift, (b) the fine structure and (c) the 
Lamb shift. 

(2.6) Transitions 

Estimate the lifetime of the excited state in a two- 
level atom when the transition wavelength is (a) 
100 nm and (b) 1000 nm. In what spectral regions 
do these wavelengths lie? 

(2.7) Selection rules 

By explicit calculation of integrals over 0, for the 
case of 7r-polarization only, verify that p to d tran- 
sitions are allowed, but not s to d. 

(2.8) Spin-orbit interaction 

The spin-orbit interaction splits a single-electron 
configuration into two levels with total angular 
momentum quantum numbers j — Z + 1/2 and 
j' — l — 1/2. Show that this interaction does not 
shift the mean energy (centre of gravity) of all the 
states given by (2 j + 1) Ej + (2 j' + 1) Ep . 

(2.9) Selection rule for the magnetic quantum number 
Show that the angular integrals for cr-transitions 
contain the factor 



e i(m il -m i2 ±l)0 d ^_ 

Hence derive the selection rule A mi p ±1 for this 
polarization. Similarly, derive the selection rule 
for the 7r- transitions. 

(2.10) Transitions 

An atom in a superposition of two states has the 



wavefunction 

(t) = Aip i (r) e ~ iElt/h + Bip 2 (r) e ~ iE2t/h . 
The distribution of electronic charge is given by 
-e |T (t ) | 2 = — e{ |A^i| 2 + \B^> 2 \ 2 

+ 1 2A* Et'ifl 1 cos (wi 2 t — (f)}- 

Part of this oscillates at the (angular) frequency 
of the transition ui 2 = 0 J 2 — wi — (E 2 — E\) /h. 

(a) A hydrogen atom is in a superposition of the 
Is ground state, ifi = i?i,o (r) Yo,o (0, <f > ), and 
the mi = 0 state of the 2p configuration, ip 2 = 
R 2 ,i (r)Vi, 0 (M); A ~ 0.995 and B = 0.1 
(so the term containing B 2 can be ignored). 
Sketch the form of the charge distribution for 
one cycle of oscillation. 

(b) The atom in a superposition state may have 
an oscillating electric dipole moment 

-eD (t) = -e(V*(t) rT (t)) . 

What are the conditions on and ^2 for 
which D (t) p 0. 

(c) Show that an atom in a superposition of the 
same states as in part (a) has a dipole moment 
of 

— eD (t) = —e \2A*B\ X ang 

x jy rR 2 ,i(r)Ri : o(r)r 2 dr| cos(cui 2 t)e z , 

where X ang is an integral with respect to 0 and 
<f. Calculate the amplitude of this dipole, in 
units of eao, for A — B — l/y/2. 

(d) A hydrogen atom is in a superposition of the 
Is ground state and the mi — 1 state of the 2p 
configuration, i/; 2 m R 21 (r) Yyi (6, (f). Sketch 
the form of the charge distribution at various 
points in its cycle of oscillation. 

(e) Comment on the relationship between the 
time dependence of the charge distributions 
sketched in this exercise and the motion of the 
electron in the classical model of the Zeeman 
effect (Section 1.8). 

(2.11) Angular eigenfunctions 

We shall find the angular momentum eigenfunc- 
tions using ladder operators, by assuming that for 
some value of l there is a maximum value of the 
magnetic quantum number ra max . For this case 
V ,m max 0(#)e imrnax ^ and the function 0(0) can 
be found from 

4©(0) ex P (i^ma X0) = 0 . 
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(a) Show that 0(0) satisfies the equation 

1 <90(0) _ _ cos 6 

e(e)~do~ =mmax il^0 • 

(b) Find the solution of the equation for 0(0). 
(Both sides have the form /'(0)//(0) whose 
integral is ln{/(0)}.) By substituting 
this solution into eqn 2.5 to show that 
b = ra max (ra max + 1), or otherwise, obtain 
eqn 2.10. 

(2.12) Parity and selection rules 

Show that eqn 2.42 implies that I 2 — li is odd. 



Hence, or otherwise, prove that X ang is zero unless 
the initial and final states have opposite parity. 

(2.13) Selection rules in hydrogen 

Hydrogen atoms are excited (by a pulse of laser 
light that drives a multi-photon process) to a spe- 
cific configuration and the subsequent spontaneous 
emission is resolved using a spectrograph. Infra- 
red and visible spectral lines are detected only at 
the wavelengths 4.05 /im, 1.87 /am and 0.656 /im. 
Explain these observations and give the values of 
n and l for the configurations involved in these 
transitions. 



Web site: 

http : //www. physics . ox. ac.uk/users/foot 

This site has answers to some of the exercises, corrections and other supplementary information. 
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Helium has only two electrons but this simplicity is deceptive. To treat 
systems with two particles requires new concepts that also apply to 
multi-particle systems in many branches of physics, and it is very worth- 
while to study them carefully using the example of helium. There is 
truth in the saying that atomic physicists count ‘one, two, many’ and a 
detailed understanding of the two-electron system is sufficient for much 
of the atomic structure in this book. 1 
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3.1 The ground state of helium 

Two electrons in the Coulomb potential of a charge Ze 
of an atom, obey a Schrodinger equation of the form 



■ffihis book considers only those multi- 
electron systems with one, or two, va- 
lence electrons ‘outside’ a spherically- 

. , , symmetric core of charge, 

e.g. the nucleus 



{ 




-h 2 ~ e 2 ( Z Z 

Vo H 

2 171 47T6o V 7*1 V2 




Eif). 



(3.1) 



Here ryi = |r*i — r 2 1 is the distance between electron 1 and electron 2 and 
the electrostatic repulsion of electrons is proportional to I/V 12 . Neglect- 
ing this mutual repulsion for the time being, we can write the equation 
as 

{H 1 +H 2 )^ = E ( - 0 ^, (3.2) 



where 



H 1 = 




Ze 2 

47reori 



(3.3) 



and H 2 is a similar expression for electron 2. Writing the atomic 
wavefunction as a product of the wavefunctions for each electron, ij; = 
^(1)^(2), allows us to separate eqn 3.2 into two single-electron 
Schrodinger equations: 



Hu/} (1) = Erf (1) (3.4) 

and a similar equation for i/; (2) with energy E 2 . The solutions of these 
one-electron equations are hydrogenic wavefunctions with energies given 
by the Rydberg formula. Helium has Z = 2 and in its ground state both 
electrons have energy E\ = £2 = —^hcRoo = — 54.4eV. Thus the total 
energy of the atom (neglecting repulsion) is 



£(°) =El +E 2 = -109 eV. 



(3.5) 
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2 l/\/47r is the angular part of an s- 
electron wavefunction. 



3 

This is a standard quantum mechan- 
ical technique whose mathematical de- 
tails are given in quantum texts. The 
essential principle of this technique is 
to find an expression for the energy 
in terms of a parameter — an effective 
atomic number in the case of helium — 
and then minimise the energy with re- 
spect to this parameter, i.e. study the 
variation in the energy as a function of 
the chosen parameter. 

4 This is accessible at http://www. 
oup.co.uk/best.textbooks/physics/ 
ephys/illustrations/ along with 
other illustrations of elementary 
quantum ideas. 

5 It is often said that ‘one electron is 
in a spin-up state and the other is spin- 
down’; what this really means is defined 
in the discussion of spin for the excited 
states of helium. 



Now we need to calculate the perturbation produced by the electron- 
electron repulsion. The system has the spatial wavefunction 

•‘/’is 2 = R z u 2 (n) Rfr 2 O2) X ; (3.6) 

where radial wavefunctions are defined in Table 2. 2. 2 The expectation 
value of the repulsion is (see Section 3.3) 

2 roo noo 

~ A / / <1 — V’is 2 r i dri dr2 = 34eV . (3.7) 

47re 0 J 0 Jo r 12 

Adding this to the (zeroth-order) estimate A* 0 - 1 gives an energy of £J(ls 2 ) 
= —109 + 34 = — 75 eV. It takes an energy of 75 eV to remove both 
electrons from a helium atom leaving a bare helium nucleus He ++ — 
the second ionization energy. To go from He + to He ++ takes 54.4 eV, 
so this estimate suggests that the first ionization energy (required to 
remove one electron from He to create He + ) is IE (He) 75 — 54 
21 eV. But the expectation value in eqn 3.7 is not small compared to 
the binding energy and therefore the perturbation has a significant effect 
on the wavefunctions. The necessary adjustment of the wavefunctions 
can be accounted for by the variational method. 3 This method gives a 
value close to the measured ionization energy 24.6 eV. Helium has the 
highest first ionization energy of all elements because of its closed n = 1 
shell. For a plot of the ionization energies of the elements see Grant and 
Phillips (2001, Chapter 11, Fig. 18). 4 

According to the Pauli exclusion principle, two electrons cannot have 
the same set of quantum numbers. Therefore there must be some ad- 
ditional quantum number associated with the two ls-electrons in the 
ground state of helium — this is their spin (introduced in Section 2.3.1). 
The observed filling-up of the atomic (sub-) shells in the periodic table 
implies that two spin states are associated with each set of spatial quan- 
tum numbers n, /, m/. 5 However, electrostatic energies do not depend 
on spin and we can find the spatial wavefunctions separately from the 
problem of finding the spin eigenfunctions. 
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6 The spatial wavefunction u contains 
both radial and angular parts but the 
energy does not depend on the mag- 
netic quantum number, so we drop m as 
a subscript on u. The repulsion from a 
spherically-symmetric Is wavefunction 
does not depend on the orientation of 
the other electron. To show this mathe- 
matically we could carry m through all 
the calculations and examine the result- 
ing angular integrals, but this is cum- 
bersome. 



To find the energy of the excited states we use the same procedure as 
for the ground state — at first we neglect the mutual repulsion term and 
separate eqn 3.1 into two one-electron equations that have solutions 6 

^ls(l) = Rls(j“l) X 7= 5 
V47T 

Unl( 2 ) = Rnl(r2)Yl,m (# 2 , 02 ) 

for the configuration Is nl. The spatial part of the atomic wavefunction 
is the product 

V^space = '^ls(l)'^nZ (2) • (3.8) 
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Another wavefunction has the same energy, namely 



V^space — ^ls (1) . 



(3.9) 



These two states are related by a permutation of the labels on the elec- 
trons, 1 2; the energy cannot depend on the labeling of identical 

particles so there is exchange degeneracy. To consider the effect of the 
repulsive term on this pair of wavefunctions with the same energy (de- 
generate states) we need degenerate perturbation theory. There are two 
approaches. The look-before-you-leap approach is first to form eigen- 
states of the perturbation from linear combinations of the initial states. 7 
In this new basis the determination of the eigenenergies of the states is 
simple. It is instructive, however, simply to press ahead and go through 
the algebra once. 8 

We rewrite the Schrodinger equation (eqn 3.1) as 

(H 0 + H')4> = Eil), (3.10) 



7 

This is guided by looking for eigen- 
states of symmetry operators that com- 
mute with the Hamiltonian for the in- 
teraction, as in Section 4.5. 

o 

In the light of this experience one can 
take the short cut in future. 



where Hq = Hi + # 2 , and we consider the mutual repulsion of the 
electrons H' = e 2 /47reo?T2 as a perturbation. We also rewrite eqn 3.2 as 

H 0 ip = E^ip, (3.11) 



where E i<<] = E\ + E 2 is the unperturbed energy. Subtraction of eqn 
3.11 from eqn 3.10 gives the energy change produced by the perturbation, 
A E = E-E ( -°\ as 

H'ip = AEip. (3.12) 

A general expression for the wavefunction with energy E^ is a linear 
combination of expressions 3.8 and 3.9, with arbitrary constants a and 

b, 

ip = aui s (l)u n/ (2) + bui s (2)u n i(l) . (3.13) 



Substitution into eqn 3.12, multiplication by either ^* s (l)iz* z (2) or 
iz* s (2)ix* z (l), and then integration over the spatial coordinates for each 
electron (tt,0i,0i and 7*2, #2? <^ 2 ) gives two coupled equations that we 
write as , ... 



J K 



K Ml 



A E 



(3.14) 



This is eqn 3.12 in matrix form. The direct integral is 



J = i // 1 ““ (1)|2 ^ dr i clr “ 

= f [ m,(n)Pn,(r 2) dr . dr », (3.15) 

47 T 60 J J r 12 



where pi s (l) = — e|^i s (l)| 2 is the charge density distribution for elec- 
tron 1, and similarly for p n i( 2). This direct integral represents the 
Coulomb repulsion of these charge clouds (Fig. 3.1). The exchange inte- 
gral is 



K = 



47T6 0 



//■ 



Ts 



(!)«■ 



ni( 2 ) — «is(2)u„/(l) dri drj . 

1 T2 



(3.16) 
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Fig. 3.1 The direct integral in a lsns 
configuration of helium corresponds to 
the Coulomb repulsion between two 
spherically-symmetric charge clouds 
made up of shells of charge like those 
shown. 



1 

v 2 K 

A tL 

J 

| Is nl 
Unperturbed 
configuration 

Fig. 3.2 The effect of the direct and 
exchange integrals on a lsrzi config- 
uration in helium. The singlet and 
triplet terms have an energy separation 
of twice the exchange integral (2K). 



9 It is easy to check which wavefunc- 
tion corresponds to which eigenvalue by 
substitution into the original equation. 



19 Another example is the classical 
treatment of the normal Zeeman effect. 




Unlike the direct integral, this does not have a simple classical interpre- 
tation in terms of charge (or probability) distributions — the exchange 
integral depends on interference of the amplitudes. The spherical sym- 
metry of the Is wavefunction makes the integrals straightforward to 
evaluate (Exercises 3.6 and 3.7). 

The eigenvalues A E in eqn 3.14 are found from 



J-AE 

K 



K 

J-AE 



= 0. 



(3.17) 



The roots of this deter minantal equation are A E = J =b K. The direct 
integral shifts both levels together but the exchange integral leads to an 
energy splitting of 2 K (see Fig. 3.2). Substitution back into eqn 3.14 
gives the two eigenvectors in which b = a and b = —a. These correspond 
to symmetric (S) and antisymmetric (A) wavef unctions: 






s 

space 

A 

space 



— { ^ls (1 )^nl (2) T ^ls (2)^77,/ (1) } 5 

= { ^ls (l)^nZ (2) ^ls (2)^77,/ ( 1 ) } • 



The wavefunction V^ace has an eigenenergy of E^ + J — K, and this 
is lower than the energy E^ + J + K for V’fpace- (For the Is nl con- 
figurations in helium K is positive.) 9 This is often interpreted as the 
electrons ‘avoiding’ each other, i.e. V^ace = 0 for n = 7*2, and for this 
wavefunction the probability of finding electron 1 close to electron 2 is 
small (see Exercise 3.3). This anticorrelation of the two electrons makes 
the expectation of the Coulomb repulsion between the electrons smaller 
than for ^ S s pace . 

The occurrence of symmetric and antisymmetric wavefunctions has a 
classical analogue illustrated in Fig. 3.3. A system of two oscillators, 
with the same resonance frequency, that interact (e.g. they are joined 
together by a spring) has antisymmetric and symmetric normal modes 
as illustrated in Fig. 3.3(b) and (c). These modes and their frequencies 
are found in Appendix A as an example of the application of degenerate 
perturbation theory in Newtonian mechanics . 10 

The exchange integral decreases as n and l increase because of the 
reduced overlap between the wavefunctions of the excited electron and 
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(a) 





Fig. 3.3 An illustration of degenerate 
perturbation in a classical system, (a) 
Two harmonic oscillators with the same 
oscillation frequency uio — each spring 
has a mass on one end and its other end 
is attached to a rigid support. An in- 
teraction, represented here by another 
spring that connects the masses, cou- 
ples the motions of the two masses. The 
normal modes of the system are (b) 
an in-phase oscillation at ujq, in which 
the spring between the masses does not 
change length, and (c) an out-of-phase 
oscillation at a higher frequency. Ap- 
pendix A gives the equations for this 
system of two masses and three springs, 
and also for the equivalent system of 
three masses joined by two springs that 
models a triatomic molecule, e.g. car- 
bon dioxide. 



the ls-electron. These trends are an obvious consequence of the form of 
the wavefunctions: the excited electron’s average orbit radius increases 
with energy and hence with n; the variation with l arises because the 
effective potential from the angular momentum (‘centrifugal’ barrier) 
leads to the wavefunction of the excited electron being small at small 
r. However, in the treatment as described above, the direct integral 
does not tend to zero as n and l increase, as shown by the following 
physical argument. The excited electron ‘sees’ the nuclear charge of +2e 
surrounded by the Is electronic charge distribution, i.e. in the region far 
from the nucleus where n/-electron’s wavefunction has a significant value 
it experiences a Coulomb potential of charge +le. Thus the excited 
electron has an energy similar to that of an electron in the hydrogen 
atom, as shown in Fig. 3.4. But we have started with the assumption 
that both the Is- and n/-electrons have an energy given by the Rydberg 
formula for Z = 2. The direct integral J equals the difference between 
these energies. 11 This work was an early triumph for wave mechanics 
since previously it had not been possible to calculate the structure of 
helium. 12 

In this section we found the wavefunctions and energy levels in helium 
by direct calculation but looking back we can see how to anticipate the 
answer by making use of symmetry arguments. The Hamiltonian for the 
electrostatic repulsion, proportional to 1 / 7*12 = 1/|it — r 2 1 , commutes 
with the operator that interchanges the particle labels 1 and 2, i.e. the 
swap operation 1 <-+ 2. (Although we shall not give this operator a 
symbol it is obvious that it leaves the value of l/ri 2 unchanged.) Com- 
muting operators have simultaneous eigenfunctions. This prompts us 



11 This can also be seen from eqn 3.15. 
The integration over r i, 6\ and 0i 
leads to a repulsive Coulomb potential 
~ e/ 47reor2 that cancels part of the at- 
tractive potential of the nucleus, when 
r -2 is greater than the values of r\ where 
*01 is appreciable. 

12 

For hydrogen, the solution of 
Schrodinger’s equation reproduced 
the energy levels calculated by the 
Bohr-Sommerfeld theory. However, 
wave mechanics does give more in- 
formation about hydrogen than the 
old quantum theory, e.g. it allows the 
detailed calculation of transition rates. 
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Helium H 



lsns lsnp lsnd lsnf n 




Fig. 3.4 The energy levels of the helium atom with those of hydrogen for comparison. The Is 2 ground configuration is tightly 
bound. For the excited configurations of helium the ls-electron screens the outer electron from the nuclear charge so that the 
Is nl configurations in helium have similar energy to the shell with principal quantum number n in hydrogen. The hydrogenic 
levels are indicated on the right. The interval between the 1 L and 3 L terms (equal to twice the exchange integral) is clear for 
the ls2s, ls2p, ls3s, ls3p and ls4s configurations but it is smaller for higher n and l. 



3.2 Excited states of helium 51 



to construct the symmetrised wavefunctions V^ace and V’fpace- 13 I n this 
basis of eigenstates it is simple to calculate the effect of the electrostatic 
repulsion. 

3.2.1 Spin eigenstates 

The electrostatic repulsion between the two electrons leads to the wave- 
functions V’fpace an d V^space i n the excited states of the helium atom. The 
ground state is a special case where both electrons have the same spatial 
wavefunction, so only a symmetric solution exists. We did not consider 
spin since electrostatic interactions depend on the charge of the particle, 
not their spin. Neither Ho nor H' contains any reference to the spin 
of the electrons. Spin does, however, have a profound effect on atomic 
wavefunctions. This arises from the deep connection between spin and 
the symmetry of the wavefunction of indistinguishable particles. 14 Note 
that here we are considering the total wavefunction in the systems that 
includes both the spatial part (found in the previous section) and the 
spin. Fermions have wavefunctions that are antisymmetric with respect 
to particle-label interchange, and bosons have symmetric ones. As a 
consequence of this symmetry property, fermions and bosons fill up the 
levels of a system in different ways, i.e. they obey different quantum 
statistics. 

Electrons are fermions so atoms have total wavefunctions that are 
antisymmetric with respect to permutation of the electron labels. This 
requires V’fpace to associate with an antisymmetric spin function i/A in , 
and the other way round: 

V> = Vfpace^U ° r ^aceVfpin • (3-18) 

These antisymmetrised wavefunctions that we have constructed fulfil the 
requirement of having particular symmetry with respect to the inter- 
change of indistinguishable particles. Now we shall find the spin eigen- 
functions explicitly. We use the shorthand notation where j and j repre- 
sent m s = 1/2 and —1/2, respectively. Two electrons have four possible 
combinations: the three symmetric functions, 

<i„ = in> 

= /|{iu) + im} (3.i9) 

» iu). 

corresponding to S' = 1 and Ms = +1,0, —1; and an antisymmetric 
function 

+in = ;^{|TI) - lit)}, (3.20) 

corresponding to S = 0 (with Ms = 0). 15 Spectroscopists label the eigen- 
states of the electrostatic interactions with the symbol 25+1 L, where S 
is the total spin and L is the total orbital angular momentum quantum 
number. The Is nl configurations in helium L = /, so the allowed terms 



13 

For two electrons, swapping the 
particle labels twice brings us back 
to where we started, so -0(1,2) = 

±0 (2, 1). Therefore the two possible 
eigenvalues are 1 for 'bfpace and —1 for 

t space • 



14 Indistinguishable means that the 
particles are identical and have the free- 
dom to exchange positions, e.g. atoms 
in a gas which obey Fermi-Dirac or 
Bose-Einstein statistics depending on 
their spin. In contrast, atoms in a 
solid can be treated as distinguishable, 
even if they are identical, because they 
have fixed positions — we could label the 
atoms 1, 2, etc. and still know which is 
which at some later time. 



15 

These statements about the result 
of adding two s = 1/2 angular mo- 
menta can be proved by formal angu- 
lar momentum theory. Simplified treat- 
ments describe S = 0 as having one 
electron with ‘spin-up’ and the other 
with ‘spin-down’; but both Mg = 0 
states are linear combinations of the 
states \m s i = +1/2, m s 2 = —1/2) and 
\m s i = -1/2, m s 2 = +1/2). 
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16 The letter ‘S’ appears over-used in 
this established notation but no ambi- 
guity arises in practice. The symbol S 
for the total spin is italic because this 
is a variable, whereas the symbols S for 
L = 0 and s for l = 0 are not italic. 



IT 

This anticipates a more general dis- 
cussion of this and other selection rules 
for the LS'-coupling scheme in a later 
chapter. 



are l L and 3 L, e.g. the ls2s configuration in helium gives rise to the 
terms X S and 3 S, where S represents L = 0. 16 

In summary, we have calculated the structure of helium in two distinct 
stages. 

(1) Energies Degenerate perturbation theory gives the space wave- 
functions ^gpace and t^space with energies split by twice the exchange 
integral. In helium the degeneracy arises because the two electrons 
are identical particles so there is exchange degeneracy, but the treat- 
ment is similar for systems where a degeneracy arises by accident. 

(2) Spin We determined the spin associated with each energy level by 
constructing symmetrised wavefunctions. The product of the spa- 
tial functions and the spin eigenstates gives the total atomic wave- 
function that must be antisymmetric with respect to particle-label 
interchange. 

Exchange degeneracy, exchange integrals, degenerate perturbation the- 
ory and symmetrised wavefunctions all occur in helium and their inter- 
relationship is not straightforward so that misconceptions abound. A 
common misinterpretation is to infer that because levels with different 
total spin, S = 0 and 1, have different energies then there is a spin- 
dependent interaction — this is not correct, but sometimes in condensed 
matter physics it is useful to pretend that it is! (See Blundell 2001.) 
The interactions that determine the gross structure of helium are en- 
tirely electrostatic and depend only on the charge and position of the 
particles. Also, degenerate perturbation theory is sometimes regarded 
as a mysterious quantum phenomenon. Appendix A gives further dis- 
cussion and shows that symmetric and antisymmetric normal modes 
occur when two classical systems, with similar energy, interact, e.g. two 
coupled oscillators. 

3.2.2 Transitions in helium 

To determine which transitions are allowed between the energy levels 
of helium we need a selection rule for spin: the total spin quantum 
number does not change in electric dipole transitions. In the matrix 
element {^fmaiM ^initial) the operator r does not act on spin; therefore, 
if the V’finai and ^initial do not have the same value of S', then their spin 
functions are orthogonal and the matrix element equals zero. 17 This 
selection rule gives the transitions shown in Fig. 3.5. 
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Fig. 3.5 The allowed transitions be- 
tween the terms of helium are governed 
by the selection rule AS = 0 in addi- 
tion to the rule Al = ±1 found pre- 
viously. Since there are no transitions 
between singlets and triplets it is con- 
venient to draw them as two separate 
systems. Notice that in the radiative 
decay of helium atoms excited to high- 
lying levels there are bottlenecks in the 
metastable ls2s 1 S and ls2s 3 S terms. 



3.3 Evaluation of the integrals in helium 

In this section we shall calculate the direct and exchange integrals to 
make quantitative predictions for some of the energy levels in the helium 
atom, based on the theory described in the previous sections. This 
provides an example of the use of atomic wavefunctions to carry out a 
calculation where there are no corresponding classical orbits and gives 
an indication of the complexities that arise in systems with more than 
one electron. The evaluation of the integrals requires care and some 
further details are given in Appendix B. The important point to be 
learnt from this section, however, is not the mathematical techniques 
but rather to see that the integrals arise from the Coulomb interaction 
between electrons treated by straightforward quantum mechanics. 



3.3.1 Ground state 

To calculate the energy of the Is 2 configuration we need to find the ex- 
pectation value of e 2 /47reo7T2 in eqn 3.1 — this calculation is the same 
as the evaluation of the mutual repulsion between two charge distribu- 
tions in classical electrostatics, as in eqn 3.15 with pi s (r\) and p n i(r 2 ) = 
pis (r 2 ). The integral can be considered in different ways. We could 
calculate the energy of the charge distribution of electron 1 in the po- 
tential created by electron 2, or the other way around. This section does 
neither; it uses a method that treats each electron symmetrically (as in 
Appendix B), but of course each approach gives the same numerical re- 
sult. Electron 1 produces an electrostatic potential at radial distance 7*2 
given by 

Vnir2)= L'^r/ {n)i311 - (3 - 21) 
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The spherical symmetry of s-electrons means that the charge in the 
region r± < r 2 acts like a point charge at the origin, so that 



V12 (r 2 ) = 



Q(r 2 ) 

47re 0 r 2 ’ 



18 Here Q (oo) = — e. 



where Q (r 2 ) is the charge within a radius of r 2 from the origin, which 
is given by 18 

rr 2 

Q{r2)= / p (n) 4nr1 dri . (3.22) 

Jo 

The electrostatic energy that arises from the repulsion equals 



Ei 2 




V 12 (r 2 ) p (r 2 ) 47rr| dr 2 . 



(3.23) 



19 As is usual in calculations of the in- 
teraction between electric charge dis- 
tributions, one must be careful to 
avoid double counting. This method 
of calculation avoids this pitfall, as 
shown by the general argument in Ap- 
pendix B. An alternative method is 
used in Woodgate (1980), Problem 5.5. 



For the Is 2 configuration there is an exactly equal contribution to the 
energy from V 21 (rq), the (partial) potential at rq produced by electron 2. 
Thus the total energy of the repulsion between the electrons is twice that 
in eqn 3.23. 19 Using the radial wavefunction for a ls-electron, we find 



J ls 2 = 2 x 



47re 0 



lo {Jo 2 E z 3 e ~ {z/ao) 2 rir i dr ^ 4z3e ~ iz/ao)2r2r % dr2 

(3.24) 



—E——\z = (13.6 eV) x ^-Z 
2a 0 4 v 4 



For helium this gives J^ 2 2 = 34 eV. 



3.3.2 Excited states: the direct integral 



20 

The effect of the repulsion propor- 
tional to l/ri 2 can be considered in 
terms of potentials like that in eqn 3.21 
(and Appendix B). The potential at 
the position of the outer electron 7*2 
arising from the charge distribution of 
electron 1 accounts for a large por- 
tion of the total repulsion: V 12 ( 7 * 2 ) — 
e 2 /47reo r 2 in the region where p n i ( 7 * 2 ) 
has an appreciable value. Hence it 
makes sense to include e 2 /47reoT2 in 
the zeroth-order Hamiltonian Ho a and 
treat the (small) part left over as a per- 
turbation H' a . 



A Is nl configuration of helium has an energy close to that of an nl- 
electron in hydrogen, e.g. in the ls2p configuration the 2p-electron has 
a similar binding energy to the n — 2 shell of hydrogen. The obvious 
explanation, in Bohr’s model, is that the 2p-electron lies outside the ls- 
orbit so that the inner electron screens the outer one from the full nuclear 
charge. Applying an analogous argument to the quantum treatment of 
helium leads to the Hamiltonian H = Ho a + H ' a , where 20 



H 0a = 



2 m 



(v? + v 



A( 1+1 ) 

47re 0 \ r i r 2/ 



(3.25) 



and 



rz _ e 2 f 1 

a 47T6 0 V r 12 




(3.26) 



In the expression for Ho a , electron 2 experiences the Coulomb attraction 
of a charge +le. In H' a the subtraction of e 2 /47reor 2 from the mutual 
repulsion means that the perturbation tends to zero at a large distance 
from the nucleus (which is intuitively reasonable). This decomposition 
differs from that in Section 3.1. The different treatment of the two 
electrons makes the perturbation theory a little tricky, but Heisenberg 
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did the calculation as described in Bethe and Salpeter (1957) or Bethe 
and Salpeter (1977); he found the direct integral 

Jisni = -f— f [ (— -) |«is(l)| 2 \u n im(2)\ 2 d 3 ri d 3 r 2 . (3.27) 

47r e 0 J J \ r i 2 r 2 / 



This must be evaluated with the appropriate wavefunctions, i.e. 
rather than and uf<r 2 as before. 21 For the excited electron = 

R n i(r)Yi m (0 , 0), where R n i( r) is the radial function for Z = 1. We write 
the direct integral as 



21 

We have not derived this integral rig- 
orously but it has an intuitively reason- 
able form. 



e 2 poo poo 

J\ S ni = ~ A / / J(ri,r 2 ) R 2 w (r 1 )R 2 nl (r 2 )r 2 1 drirldr 2 , (3.28) 

dTTCo Jo Jo 

where the angular parts are contained in the function 22 

p27T P7T p2tT P7T , 



J(ri,r 2 ) = 



Y± - 1) l 



I ,,, \ I a \ Ylrn ^ 2 , 02)1 

1 0 J o Jo Jo \ r i 2 r 2 ) 4tt (3.29) 

x sin Oi dOi dcfi sin 0 2 d0 2 d(f 2 . 



22 >00 (01,0l)«l /v^F. 



The calculation of this integral requires the expansion of 1/ r\ 2 in terms 
of spherical harmonics: 23 



1 

r\2 




47 r 

2kTl 



k 

q= — k 



(3.30) 



23y M 



for r 2 > r\ (and rq r 2 when rq > r 2 ). Only the term for k = 0 
survives in the integration over angles in eqn 3.29 to give 24 



J{ri,r 2 ) 



0 for n < r 2 , 

l/rq — l/r 2 for rq > r 2 . 



24 When k / 0 the integral of the func- 
tion Y k * (6 i,<fi) over 6h and f>\ equals 
zero. 



When n < r 2 the original screening argument applies and eqn 3.25 
gives a good description. When n > r 2 the appropriate potential is 
proportional to —2/r 2 — l/rq and J(r 1 , 7 * 2 ) accounts for the difference 
between this and — 2 /t*i — l/r 2 used in Ho a - Thus we find 



{!,, (n“^) 

(3.31) 

Evaluation of this integral for the ls2p configuration (in Exercise 3.6) 
gives Jis 2 p = —2.8 x 10 -2 eV — three orders of magnitude smaller than 
Jjjf 2 in eqn 3.7 (evaluated from eqn 3.24). The unperturbed wavefunc- 
tion for Z = 1 has energy equal to that of the corresponding level in 
hydrogen and the small negative direct integral accounts for the incom- 
pleteness of the screening of the n/-electron by the inner electron. 



3.3.3 Excited states: the exchange integral 

The exchange integral has the same form as eqn 3.16 but with 
rather than (and uf<r 2 as before). Within the spatial wavefunction 
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25 

At small r the wavefunction of an 
ns-electron deviates significantly from 
w ns =1 5 f° r thi s reason ls2p was chosen 
as an example above. 

26 

The overlap of the Is and nl wave- 
functions becomes smaller as n and 
l increase. In Heisenberg’s treatment 
where screening is taken into account, 
the direct integral gives the deviation 
from the hydrogenic levels (which could 
be characterised by a quantum de- 
fect as in the alkalis, see Chapter 4). 
For electrons with l / 0 the term 
h 2 l (l + 1 ) / 2 mr 2 in the Schrodinger 
equation causes the electron’s wave- 
function to lie almost entirely outside 
the region where uf~ 2 = R\ s (r) /\/47r 
has a significant value. 



u n im = </>) only the radial part depends on Z. We write 

the exchange integral as (cf. eqn 3.28) 



-^lsnZ — 



/ / K(n, r 2 )R ls (ri)Rni (n )R ls (r 2 )R n i {r 2 )r\ dn r\ dr 2 . 

(3.32) 

The function K(ri,r 2 ) containing the angular integrals is (cf. eqn 3.29) 



47reo 



K(n,r 2 ) = II JJ 

x sin 6>1 dOi d^i sin 0 2 d0 2 dfo • 



(3.33) 



For the lsnp configuration only the second term of the expansion in 
eqn 3.30, with k = 1, survives in the integration because of the orthog- 
onality of the spherical harmonic functions (see Exercise 3.7), to give 



K(ri,r 2 ) 



ri/3r 2 for v\ < r 2 , 
r 2 /3rl for r 2 < r \ . 



(3.34) 



Carrying out the integration over the radial wavefunctions in eqn 3.32 for 
the ls2p configuration gives the splitting between 3 P and X P as 2Ki s2p ~ 
0.21 eV (close to the measured value of 0.25 eV). 

The assumption that the excited electron lies outside the Is wave- 
function does not work so well for lsns configurations since ^ ns (0) has 
a finite value and the above method of calculating J and K is less 
accurate. 25 The ls2s configuration of helium has a singlet-triplet sepa- 
ration of E ( 1 S) — E ( 3 S) = 2Ki s2s ~ 0.80 eV and the direct integral is 
also larger than that for ls2p — these trends are evident in Fig. 3.4 (see 
also Exercise 3. 7). 26 

In some respects, helium is a more typical atom than hydrogen. The 
Schrodinger and Dirac equations can be solved exactly for the one- 
electron system, but not for helium or other atoms with more electrons. 
Thus in a careful study of helium we encounter the approximations 
needed to treat multi-electron atoms, and this is very important for 
understanding atomic structure in general. Helium also gives a good 
example of the influence of identical particles on the occupation of the 
states in quantum systems. The energy levels of the helium atom (and 
the existence of exchange integrals) do not depend on the fact that the 
two electrons are identical, as demonstrated in Exercises 3.3 and 3.4; 
however, this is a common point of confusion. The books recommended 
for further reading give clear and accurate descriptions of helium that 
reward careful study. 



Further reading 

The recommended books are divided into two categories corresponding 
to the two main themes in this chapter: (a) a description of how to 
calculate the electrostatic energy in an atom with more than one elec- 
tron, which introduces principles that can be used in atoms with more 
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electrons; and (b) a discussion of the influence of identical particles on 
the statistics of a quantum system that is important throughout physics. 
The influence of identical particles on the occupation of the quantum lev- 
els of a system with many particles, i.e. Bose-Einstein and Fermi-Dirac 
statistics, is discussed in statistical mechanics texts. Clear descriptions 
of helium may be found in the following textbooks: Cohen- Tannoudji et 
al. (1977), Woodgate (1980) and Mandl (1992). The calculation of the 
direct and exchange integrals in Section 3.3 is based on the definitive 
work by Bethe and Salpeter (1957), or see Bethe and Jackiw (1986). 

A very instructive comparison can be made between the properties of 
the two electrons in helium and the nuclear spin statistics of homonu- 
clear diatomic molecules 27 described in Atkins (1983, 1994). 28 There 
are diatomic molecules with nuclei that are identical bosons, identical 
fermions and cases of two similar but not identical particles, and their 
study gives a wider perspective than the study of helium alone. The 
nuclei of the two atoms in a hydrogen molecule are protons which are 
fermions (like the two electrons in helium). 29 For reasons explained in 
the above references, we can consider only those parts of the molecular 
wavefunction that describe the rotation x/j ro t and the nuclear spin states 
'ipi — these are spatial and spin wavefunctions, respectively. For H 2 the 
wavefunction must have overall antisymmetry with respect to an inter- 
change of particle labels since the nuclei are protons, each with a spin 
of 1/2. This requires that a rotational must pair with a spin function of 
the opposite symmetry: 



27 

Molecules made up of two atoms with 
identical nuclei. 

28 

These books also summarise the he- 
lium atom and the quantum mechan- 
ics of these molecular systems is very 
closely related to atomic physics. 

2q 

The wavefunction of the hydrogen 
molecule has exchange symmetry — 
crudely speaking, the molecule looks 
the same when rotated through 180 °. 



V’molecule = V’rotV’/ or • ( 3 - 35 ) 

This is analogous to eqn 3.18 for helium; as described in Section 3.2.1, 
the two spin- 1/2 nuclei in a hydrogen molecule give a total (nuclear) 
spin of 0 and 1, with one state and three states, respectively. The 1 to 3 
ratio of the number of nuclear spins associated with the energy levels for 
t and 'i/jA t , respectively, influences the populations of these rotational 
energy levels in a way that is directly observed in molecular spectra (the 
intensity of the lines in spectra depends on the population of the initial 
level). The molecule HD made from hydrogen and deuterium does not 
have identical nuclei so there is no overall symmetry requirement, but 
it has similar energy levels to those of H 2 apart from the mass depen- 
dence. This gives a real physical example where the statistics depends 
on whether the particles are identical or not, but the energy of the sys- 
tem does not. Exercise 3.4 discusses an artificial example: a helium-like 
system that has the same energy levels as a helium atom and hence the 
same direct and exchange integrals, even though the constituent parti- 
cles are not identical. 
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Exercises 

More advanced problems are indicated by a *. 



(3.1) Estimate of the binding energy of helium 

(a) Write down the Schrodinger equation for the 
helium atom and state the physical significance 
of each of the terms. 

(b) Estimate the equilibrium energy of an electron 
bound to a charge -\-Ze by minimising 



2 mr 2 47reo r 

(c) Calculate the repulsive energy between the two 
electrons in helium assuming that n 2 ~ r. 
Hence estimate the ionization energy of helium. 

(d) Estimate the energy required to remove a fur- 
ther electron from the helium- like ion Si 12+ , 
taking into account the scaling with Z of the 
energy levels and the expectation value for 
the electrostatic repulsion. The experimen- 
tal value is 2400 eV. Compare the accuracy of 
your estimates for Si 12+ and helium. (IE(He) 
= 24.6 eV.) 

(3.2) Direct and exchange integrals for an arbitrary 
system 

(a) Verify that for 

^ A (ri,r 2 ) 

~^= {u a (n)uf3(r2) - Uoc(r2)up(ri) } 
v 2 

and H' — e 2 /47reo^i2 the expectation value 
(^ A \ H' |'0 A ) has the form J — K and give the 
expressions for J and K. 

(b) Write down the wavefunction '0 s that is orthog- 
onal to fj A . 

(c) Verify that (^ A \ H' |^ s ) = 0 so that H' is di- 
agonal in this basis. 

(3.3) Exchange integrals for a delta-function interaction 
A particle in a square- well potential, with V(x) =0 
for 0 < x < £ and V(x) — oo elsewhere, has 
normalised eigenfunctions uo(x) = \j2/i sin (jrx/ E) 
and ui(x) = yj 2/t sin (27 nr/ £). 

(a) What are the eigenenergies Eq and E\ of these 
two wavefunctions for a particle of mass ml 



(b) Two particles of the same mass m are both in 
the ground state so that the energy of the whole 
system is 2Eq. Calculate the perturbation pro- 
duced by a point- like interaction described by 
the potential ad (x i — x 2 ), with a constant. 

(c) Show that, when the two interacting particles 
occupy the ground and first excited states, the 
direct and exchange integrals are equal. Also 
show that the delta-function interaction pro- 
duces no energy shift for the antisymmetric spa- 
tial wavefunction and explain this in terms of 
correlation of the particles. Calculate the en- 
ergy of the other level of the perturbed system. 

(d) For the two energy levels found in part (c), 
sketch the spatial wavefunction as a function 
of the coordinates of the two particles x\ and 
X 2 - The particles move in one dimension but 
the two-particle wavefunction exists in a two- 
dimensional Hilbert space — draw either a con- 
tour plot in the xix 2 -plane or attempt a three- 
dimensional sketch (by hand or computer). 

(e) The two particles are identical and have spin 
1/2. What is the total spin quantum number S 
associated with each of the energy levels found 
in part (c)? 

*(f) Discuss qualitatively the energy levels of this 
system for two particles that have slightly dif- 
ferent masses m± ^ m 2 , so that they are distin- 
guishable? [Hint. The spin has not been given 
because it is not important for non-identical 
particles.] 

Comment. The antisymmetric spatial wavefunction 
in part (c) clearly has different properties from a 
straightforward product uqU\. The exchange inte- 
gral is a manifestation of the entanglement of the 
multiple-particle system. 

(3.4) A helium-like system with non-identical particles 
Imagine that there exists an exotic particle with the 
same mass and charge as the electron but spin 3/2 
(so it is not identical to the electron). This par- 
ticle and an electron form a bound system with a 
helium nucleus. Compare the energy levels of this 
system with those of the helium atom. Describe 
the energy levels of a system with two of the ex- 
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otic particles bound to a helium nucleus (and no 
electrons). [Hint. It is not necessary to specify the 
values of total spin associated with the levels.] 

(3.5) The integrals in helium 

(a) Show that the integral in eqn 3.24 gives the 
value stated in eqn 3.7. 

(b) Estimate the ground-state energy of helium us- 
ing the variational principle. (The details of 
this technique are not given in this book; see 
the section on further reading.) 

(3.6) Calculation of integrals for the ls2p configuration 

(a) Draw a scale diagram of i?f s =2 (r), R^r 1 (r) 
and R f p =1 (r). (See Table 2.2.) 

(b) Calculate the direct integral in eqn 3.31 and 
show that it gives 

T _ e 2 /47re 0 13 

ls2p ” 2^55 * 

Give the numerical value in eV (cf. that given 
in the text). 

(3.7) Expansion of 1 / 7*12 

For 7*1 < 7*2 the binomial expansion of 

— - = (rf + r\ - 2 nr 2 cos 0 12 ) 1/2 

7*12 V ' 



is 



1 

7*12 



1_ 

7*2 



— 2 — COS 012 + 
7*2 




2 



- 1/2 



_ 1 _ 

7*2 




COS 012 T • • • 

7*2 



(3.36) 



(When 7*1 > 7*2 we must interchange 7*1 and 7*2 to ob- 
tain convergence.) The cosine of the angle between 
1*1 and r 2 is 



COS 012 = ri • 1*2 

= cos 0i cos 02 + sin 0i sin O 2 cos (0i — $ 2 ) ■ 

(a) Show that the first two terms in the binomial 
expansion agree with the terms with k — 0 and 
1 in eqn 3.30. 

(b) The repulsion between a Is- and an 7iZ-electron 
is independent of m. Explain why, physically 
or mathematically. 

(c) Show that eqn 3.32 leads to eqn 3.34 for l = 1. 

(d) For a lsnl configuration, the quantity K(ri, 7*2) 
in eqn 3.34 is proportional to ri/r 1 ^ 1 when 
7*1 < 7*2. Explain this in terms of mathemat- 
ical properties of the Y^ m functions. 



Web site: 

http : //www . physics . ox . ac . uk/users/f oot 

This site has answers to some of the exercises, corrections and other supplementary information. 
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1 Also referred to by its original German 
name as the Aufbauprinciple. An ex- 
tensive discussion of the atomic struc- 
ture that underlies the periodic table 
can be found in chemistry texts such as 
Atkins (1994). 

o 

Most of the arrangement of elements 
in a periodic table was determined by 
chemists, such as Mendeleev, in the 
nineteenth century. A few inconsisten- 
cies in the ordering were resolved by 
Moseley’s measurements of X-ray spec- 
tra (see Chapter 1). 

3 

The configuration of an atom is spec- 
ified by a list of nl with the occupancy 
as an exponent. Generally, we do not 
need to list the full configuration and it 
is sufficient to say that a sodium atom 
in its ground state has the configura- 
tion 3s. A sodium ‘atom’ with one elec- 
tron in the 3s level, and no others, is 
an excited state of the highly-charged 
ion Na +1 ° — this esoteric system can be 
produced in the laboratory but confu- 
sion with the common sodium atom is 
unlikely. 



The alkalis 



4.1 Shell structure and the periodic table 

For multi-electron atoms we cannot solve the Hamiltonian analytically, 
but by making appropriate approximations we can explain their struc- 
ture in a physically meaningful way. To do this, we start by considering 
the elementary ideas of atomic structure underlying the periodic table 
of the elements. In the ground states of atoms the electrons have the 
configuration that minimises the energy of the whole system. The elec- 
trons do not all fall down into the lowest orbital with n — 1 (the K-shell) 
because the Pauli exclusion principle restricts the number of electrons 
in a given (sub-) shell — two electrons cannot have the same set of quan- 
tum numbers. This leads to the ‘building-up’ principle: electrons fill up 
higher and higher shells as the atomic number Z increases across the 
periodic table. 1 Full shells are found at atomic numbers Z = 2, 10, . . . 
corresponding to helium and the other inert gases. These inert gases, in 
a column on the right-hand side of the periodic table (see inside front 
cover) , were originally grouped together because of their similar chemical 
properties, i.e. the difficulty in removing an electron from closed shells 
means that they do not readily undergo chemical reactions. 2 However, 
inert gas atoms can be excited to higher-lying configurations by bom- 
bardment with electrons in a gas discharge, and such processes are very 
important in atomic and laser physics, as in the helium-neon laser. 

The ground states of the alkalis have the following electronic con- 
figurations: 3 



lithium 


Li 


Is 2 2s, 


sodium 


Na 


ls 2 2s 2 2p 6 3s , 


potassium 


K 


ls 2 2s 2 2p 6 3s 2 3p 6 4s , 


rubidium 


Rb 


ls 2 2s 2 2p 6 3s 2 3p 6 3d 10 4s 2 4p 6 5s , 


caesium 


Cs 


ls 2 2s 2 2p 6 3s 2 3p 6 3d 10 4s 2 4p 6 4d 10 5s 2 5p 6 6s 



The alert reader will notice that the sub-shells of the heavier alkalis 
are not filled in the same order as the hydrogenic energy levels, e.g. elec- 
trons occupy the 4s level in potassium before the 3d level (for reasons 
that emerge later in this chapter). Thus, strictly speaking, we should 
say that the inert gases have full sub-shells, e.g. argon has the electronic 
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Table 4.1 Ionization energies of the inert gases and alkalis. 



Element 


Z 


IE (eV) 


He 


2 


24.6 


Li 


3 


5.4 


Ne 


10 


21.6 


Na 


11 


5.1 


Ar 


18 


15.8 


K 


19 


4.3 


Kr 


36 


14.0 


Rb 


37 


4.2 


Xe 


54 


12.1 


Cs 


55 


3.9 



configuration ls 2 2s 2 2p 6 3s 2 3p 6 with the 3d sub-shell unoccupied. 4 

Each alkali metal comes next to an inert gas in the periodic table 
and much of the chemistry of the alkalis can be explained by the simple 
picture of their atoms as having a single unpaired electron outside a core 
of closed electronic sub-shells surrounding the nucleus. The unpaired 
valence electron determines the chemical bonding properties; since it 
takes less energy to remove this outer electron than to pull an electron 
out of a closed sub-shell (see Table 4.1), thus the alkalis can form singly- 
charged positive ions and are chemically reactive. 5 However, we need 
more than this simple picture to explain the details of the spectra of the 
alkalis and in the following we shall consider the wavefunctions. 



4.2 The quantum defect 

The energy of an electron in the potential proportional to 1/r depends 
only on its principal quantum number n, e.g. in hydrogen the 3s, 3p and 
3d configurations all have the same gross energy. These three levels are 
not degenerate in sodium, or any atom with more than one electron, 
and this section explains why. Figure 4.1 shows the probability density 
of 3s-, 3p- and 3d-electrons in sodium. The wavefunctions in sodium 
have a similar shape (number of nodes) to those in hydrogen. The 3d 
wavefunction has a single lobe outside the core so that it experiences 
almost the same potential as in a hydrogen atom; therefore this electron, 
and other d configurations in sodium with n > 3, have binding energies 
similar to those in hydrogen, as shown in Fig. 4.2. In contrast, the 
wavefunctions for the s-electrons have a significant value at small r — 
they penetrate inside the core and ‘see’ more of the nuclear charge. Thus 
the screening of the nuclear charge by the other electrons in the atom is 
less effective for ns configurations than for nd, and s-electrons have lower 
energy than d-electrons with the same principal quantum number. (The 
np-electrons he between these two. 6 ) The following modified form of 



4 This book takes a shell to be all energy 
levels of the same principal quantum 
number n, but the meaning of shell and 
sub-shell may be different elsewhere. 
We use sub-shell to denote all energy 
levels with specific values of n and l 
(in a shell with a given value of n). 
We used these definitions in Chapter 1; 
the inner atomic electrons involved in 
X-ray transitions follow the hydrogenic 
ordering. 

5 For a plot of the ionization energies of 
all the elements see Grant and Phillips 
(2001, Chapter 11, Fig. 18). This 
figure is accessible at http://www. 
oup . co . uk/best . textbooks/physics/ 
ephys/ illustrations/. 



6 This dependence of the energy on the 
quantum number l can also be ex- 
plained in terms of the elliptical or- 
bits of Bohr-Sommerfeld quantum the- 
ory rather than Schrodinger’s wave- 
functions; however, we shall use only 
the ‘p r °per’ wavefunction description 
since the detailed correspondence be- 
tween the elliptical classical trajectories 
and the radial wavefunctions can lead 
to confusion. 



62 The alkalis 



Fig. 4.1 The probability density of the 
electrons in a sodium atom as a func- 
tion of r. The electrons in the n = 1 
and n = 2 shells make up the core, 
and the probability density of the un- 
paired outer electron is shown for the 
n = 3 shell with 1 = 0,1 and 2. The 
probability is proportional to |P(r)| 2 = 
r 2 |P(r)| 2 ; the r 2 factor accounts for the 
increase in volume of the spherical shell 
between r and r + dr (i.e. 47rr 2 dr) as 
the radial distance increases. The de- 
creasing penetration of the core as l 
increases can be seen clearly — the 3d- 
electron lies mostly outside the core 
with a wavefunction and binding en- 
ergy very similar to those for the 3d 
configuration in hydrogen. These wave- 
functions could be calculated by the 
simple numerical method described in 
Exercise 4.10, making the ‘frozen core’ 
approximation, i.e. that the distribu- 
tion of the electrons in the core is 
not affected by the outer electron — this 
gives sufficient accuracy to illustrate 
the qualitative features. (The iterative 
method described in Section 4.4 could 
be used to obtain more accurate numer- 
ical wavefunctions.) 




Bohr’s formula works amazingly well for the energy levels of the alkalis: 



E(n,l) = -he — 

(n - SO 



(4.1) 



7 

This differs from the modification 
used for X-ray transitions in Chapter 
1 — hardly surprising since the physical 
situation is completely different for the 
inner and outer electrons. 



A quantity 5i , called the quantum defect, is subtracted from the prin- 
cipal quantum number to give an effective principal quantum number 
n* = n — Si . 7 The values of the quantum defects for each l can be esti- 
mated by inspecting the energy levels shown in Fig. 4.2. The d-electrons 
have a very small quantum defect, 5 a ~ 0, since their energies are nearly 
hydrogenic. We can see that the 3p configuration in sodium has com- 
parable energy to the n = 2 shell in hydrogen, and similarly for 4p and 
n = 3, etc.; thus <S P ~ 1. It is also clear that the quantum defect for 
s-electrons is greater than that for p-electrons. A more detailed analysis 
shows that all the energy levels of sodium can be parametrised by the 
above formula and only three quantum defects: 
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E (eV) 
0— r 



Sodium H 

ns np nd nf n 




6 



5 



- 1 - 



-2 - 



4 



3 



-3- 




-4- 



— 2 



-5- 



- 6— 1 



Fig. 4.2 The energies of the s, p, d and 
f configurations in sodium. The energy 
levels of hydrogen are marked on the 
right for comparison. The guidelines 
link configurations with the same n to 
show how the energies become closer to 
the hydrogenic values as l increases, i.e. 
the quantum defects decrease so that 
Si ~ 0 for f-electrons (and for the con- 
figurations with l > 3 that have not 
been drawn). 



5 S = 1.35 , £ p = 0.86 , 5d = 0.01, ft ~ 0.00 for l> 2. 

There is a small variation with n (see Exercise 4.3). Having examined 
the variation in the quantum defects with orbital angular momentum 
quantum number for a given element, now let us compare the quantum 
defects in different alkalis. The data in Table 4.1 show that the alkalis 
have similar ionization energies despite the variation in atomic number. 
Thus the effective principal quantum numbers n* = (13.6 eV/IE) 1 ^ 2 
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Table 4.2 The effective principal quantum numbers and quantum defects for the 
ground configuration of the alkalis. Note that the quantum defects do depend slightly 
on n (see Exercise 4.3), so the value given in this table for the 3s-electron in sodium 
differs slightly from the value given in the text ( S s = 1.35) that applies for n > 5. 



Element 


Configuration 


n* 


5s 


Li 


2s 


1.59 


0.41 


Na 


3s 


1.63 


1.37 


K 


4s 


1.77 


2.23 


Rb 


5s 


1.81 


3.19 


Cs 


6s 


1.87 


4.13 



(from eqn 4.1) are remarkably similar for all the ground configurations 
of the alkalis, as shown in Table 4.2. 

In potassium the lowering of the energy for the s-electrons leads to the 
4s sub-shell filling before 3d. By caesium (spelt cesium in the US) the 
6s configuration has lower energy than 4f (5f ~ 0 for Cs) . The exercises 
give other examples, and quantum defects are tabulated in Kuhn (1969) 
and Woodgate (1980), amongst others. 



4.3 The central- field approximation 



The previous section showed that the modification of Bohr’s formula by 
the quantum defects gives reasonably accurate values for the energies 
of the levels in alkalis. We described an alkali metal atom as a single 
electron orbiting around a core with a net charge of +le, i.e. the nucleus 
surrounded by N — 1 electrons. This is a top-down approach where we 
consider just the energy required to remove the valence electron from 
the rest of the atom; this binding energy is equivalent to the ionization 
energy of the atom. In this section we start from the bottom up and 
consider the energy of all the electrons. The Hamiltonian for N electrons 
in the Coulomb potential of a charge -\-Ze is 



N 



i = 1 




Ze 2 / 47reo 
n 



e2 / 4 ^p 1 

j>i ^ ) 



(4.2) 



8 

For example, lithium has three inter- 
actions between the three electrons, in- 
versely proportional to 7 * 12 , ^13 and 7 * 23 ; 
summing over all j for each value of i 
would give six terms. 



The first two terms are the kinetic energy and potential energy for each 
electron in the Coulomb field of a nucleus of charge Z. The term with 
rij = \n — rj\ in the denominator is the electrostatic repulsion between 
the two electrons at and Yj. The sum is taken over all electrons with 
j > i to avoid double counting. 8 This electrostatic repulsion is too large 
to be treated as a perturbation; indeed, at large distances the repulsion 
cancels out most of the attraction to the nucleus. To proceed further 
we make the physically reasonable assumption that a large part of the 
repulsion between the electrons can be treated as a central potential 
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S (r) . This follows because the closed sub-shells within the core have 
a spherical charge distribution, and therefore the interactions between 
the different shells and between shells and the valence electron are also 
spherically symmetric. In this central-field approximation the total po- 
tential energy depends only on the radial coordinate: 

V GF (r) = -^^+S(r). (4.3) 

In this approximation the Hamiltonian becomes 

^CF = E{-£ V ? + y CF(r i )}. (4.4) 

i—1 ^ ' 

For this form of potential, the Schrodinger equation for N electrons, 
Hip = E atom ip, can be separated into N one-electron equations, i.e. 
writing the total wavefunction as a product of single-electron wavefunc- 
tions, namely 

^atom = •■•IpN, (4-5) 

leads to N equations of the form 

(-^-V? + VcF(ri)}vi=^i, (4-6) 



and similar for electrons i = 2 to N. This assumes that all the elec- 
trons see the same potential, which is not as obvious as it may appear. 
This symmetric wavefunction is useful to start with (cf. the treatment 
of helium before including the effects of exchange symmetry); however, 
we know that the overall wavefunction for electrons, including spin, 
should be antisymmetric with respect to an interchange of the particle 
labels. (Proper antisymmetric wavefunctions are used in the Hartree- 
Fock method mentioned later in this chapter.) The total energy of the 
system is E atom = E\ + + . . . + E^. The Schrodinger equations for 

each electron (eqn 4.6) can be separated into parts to give wavefunctions 
of the form ipi = R(ri)Yi 1:rni ip sp i n (l). Angular momentum is conserved 
in a central field and the angular equation gives the standard orbital 
angular momentum wavefunctions, as in hydrogen. In the radial equa- 
tion, however, we have Vcf(^) rather than a potential proportional to 
1/r and so the equation for P (r) = rR (r) is 



{ 



Ail 

2 m dr 2 



+ Vcf (r) + 



h 2 l(l + l) 
2 mr 2 



P(r) = EP(r ) . 



(4.7) 



To solve this equation we need to know the form of Vcf M and compute 
the wavefunctions numerically. However, we can learn a lot about the 
behaviour of the system by thinking about the form of the solutions, 
without actually getting bogged down in the technicalities of solving the 
equations. At small distances the electrons experience the full nuclear 
charge so that the central electric field is 

Ze ^ 

Aire^r 2 



E(r) 



(4.8) 
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Fig. 4.3 The change-over from the short- to the long-range is not calculated but is 
drawn to be a reasonable guess, using the following criteria. The typical radius of 
the Is wavefunction around the nucleus of charge +Ze = +lle is about ao/11 , and 
so Z e ff will start to drop at this distance. We know that Z e ^ ~ 1 at the distance 
at which the 3d wavefunction has appreciable probability since that eigenstate has 
nearly the same energy as in hydrogen. The form of the function Z e ff (r) can be 
found quantitatively by the Thomas-Fermi method described in Woodgate (1980). 



At large distances the other N — 1 electrons screen most of the nuclear 
charge so that the field is equivalent to that of charge Tie: 

E(r) — > 6 2 r . (4.9) 

v ' 4n e 0 r 2 v ’ 

These two limits can be incorporated in a central field of the form 

(4io> 



^ This is not necessarily the best way to 
parametrise the problem for numerical 
calculations but it is useful for under- 
standing the underlying physical prin- 
ciples. 



The effective atomic number Z e Q (r) has limiting values of Z e fp (0) = Z 
and (r) — > 1 as r — > oo, as sketched in Fig. 4. 3. 9 The potential 
energy of an electron in the central held is obtained by integrating from 
infinity: 

V C F(r)=ef lEcF^OIdr'. (4.11) 

J oo 

The form of this potential is shown in Fig. 4.4. 

So far, in our discussion of the sodium atom in terms of the wave- 
function of the valence electron in a central held we have neglected 



Fig. 4.4 The form of the potential en- 
ergy of an electron in the central-field 
approximation (e^ = e 2 /47reo)- This 
approximate sketch for a sodium atom 
shows that the potential energy crosses 
over from Vcf ( r) = ~ e M/ r a t l° n g 
range to -lle^/r + Voffset; the con- 
stant ^offset comes from the integration 
in eqn 4.11 (if Z e ^{r) = 11 for all r then 
Offset — 0 but this is not the case). For 
electrons with l > 0 the effective poten- 
tial should also include the term that 
arises from the angular momentum, as 
shown in Fig. 4.5. 




4.3 
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the fact that the central field itself depends on the configuration of the 
electrons in the atom. For a more accurate description we must take 
into account the effect of the outer electron on the other electrons, and 
hence on the central field. The energy of the whole atom is the sum of 
the energies of the individual electrons (in eqn 4.6), e.g. a sodium atom in 
the 3s configuration has energy E (ls 2 2s 2 2p 6 3s) = 2£ , i s -h2^2 S + 6£ , 2 P + 
= E core + E 3s . This is the energy of the neutral atom relative to the 
bare nucleus (Na 11+ ). 10 It is more useful to measure the binding energy 
relative to the singly-charged ion (Na + ) with energy E (ls 2 2s 2 2p 6 ) = 
2E' ls +2E' 2s +6E' 2p = E ' c ore . The dashes are significant — the ten electrons 
in the ion and the ten electrons in the core of the atom have slightly 
different binding energies because the central field is not the same in 
the two cases. The ionization energy is IE = E atom — Eion = (Ecore — 
E^ore ) + -£/3 s • From the viewpoint of valence electrons, the difference 
in E core between the neutral atom and the ion can be attributed to 
core polarization, i.e. a change in the distribution of charge in the core 
produced by the valence electron. 11 To calculate the energy of multi- 



Fig. 4.5 The total potential in the 
central-field approximation including 
the term that is proportional to l (l + 
1 )/r 2 drawn here for l = 2 and the 
same approximate electrostatic Vcf( 0 
as shown in Fig. 4.4. The angular 
momentum leads to a ‘centrifugal bar- 
rier’ that tends to keep the wavefunc- 
tions of electrons with l > 0 away from 
r = 0 where the central-field potential 
is deepest. 



1( ^This is a crude approximation, espe- 
cially for inner electrons. 



11 This effect is small in the alkalis and 
it is reasonable to use the frozen core 
approximation that assumes E core ~ 
E ' c ore . This approximation becomes 

more accurate for a valence electron in 
higher levels where the influence on the 
core becomes smaller. 
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12 

This is in contrast to an infinite 
square well where confinement to a re- 
gion of fixed dimensions gives energies 
proportional to n 2 , where n is an inte- 
ger. 



electron atoms properly we should consider the energy of the whole 
system rather than focusing attention on only the valence electron. For 
example, neon has the ground configuration ls 2 2s 2 2p 6 and the electric 
field changes significantly when an electron is excited out of the 2p sub- 
shell, e.g. into the ls 2 2s 2 2p 5 3s configuration. 

Quantum defects can be considered simply as empirical quantities that 
happen to give a good way of parametrising the energies of the alkalis but 
there is a physical reason for the form of eqn 4.1. In any potential that 
tends to 1/r at long range the levels of bound states bunch together 
as the energy increases — at the top of the well the classically allowed 
region gets larger and so the intervals between the eigenenergies and the 
stationary solutions get smaller. 12 More quantitatively, in Exercise 1.12 
it was shown, using the correspondence principle, that such a potential 
has energies E oc 1/fc 2 , with A k = 1 between energy levels, but k 
is not itself necessarily an integer. For the special case of a potential 
proportional to 1/r for all distances, k is an integer that we call the 
principal quantum number n and the lowest energy level turns out to 
be n = 1. For a general potential in the central- field approximation we 
have seen that it is convenient to write k in terms of the integer n as 
k = n — S, where S is a non-integer (quantum defect). To find the actual 
energy levels of an alkali and hence S (for a given value of /) requires the 
numerical calculation of the wavefunctions, as outlined in the following 
section. 



4.4 Numerical solution of the Schrodinger 
equation 

Before describing particular methods of solution, let us look at the gen- 
eral features of the wavefunction for particles in potential wells. The 
radial equation for P (r) has the form 

^ = -f?{E-V(r)}P, (4.12) 

where the potential V (r) includes the angular momentum term in eqn 
4.7. Classically, the particle is confined to the region where E—V(r ) > 0 
since the kinetic energy must be positive. The positions where E = V (r) 
are the classical turning points where the particle instantaneously comes 
to rest, cf. at the ends of the swing of a pendulum. The quantum 
wavefunctions are oscillatory in the classically allowed region, with the 
curvature and number of nodes both increasing as E — V (r) increases, 
as shown in Fig. 4.6. The wavefunctions penetrate some way into the 
classically forbidden region where E — V (r) <0; but in this region the 
solutions decay exponentially and the probability falls off rapidly. 

How can we find P (r) in eqn 4.12 without knowing the potential V (r)? 
The answer is firstly to find the wavefunctions for a potential Vcf(^) 
that is ‘a reasonable guess’, consistent with eqn 4.11 and the limits on 
the central electric field in the previous equations. Then, secondly, we 
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Arbitrary 

units 




Fig. 4.6 The potential in the central- 
field approximation including the term 
that is proportional to l{l + l)/r 2 is 
drawn here for l = 2 and the same 
approximate electrostatic Vqf ( r ) as 
shown in Fig. 4.4. The function P(r) = 
rR(r) was drawn for n = 6 and l = 2 
using the method described in Exer- 
cise 4.10. 



make the assumed potential correspond closely to the real potential, as 
described in the next section. Equation 4.12 is a second-order differen- 
tial equation and we can numerically calculate P(r), the value of the 
function at r, from two nearby values, e.g. u(r — Sr) and u(r — 2 Sr). 13 
Thus, working from near r = 0, the method gives the numerical value of 
the function at all points going out as far as is necessary. The region of 
the calculation needs to extend beyond the classical turning point (s) by 
an amount that depends on the energy of the wavefunction being calcu- 
lated. These general features are clearly seen in the plots produced in 
Exercise 4.10. Actually, that exercise describes a method of finding the 
radial wavefunction R(r) rather than P (r) = rR (r) but similar princi- 
ples apply. 14 If you carry out the exercise you will find that the behaviour 
at large r depends very sensitively on the energy E — the wavefunction 
diverges if E is not an eigenenergy of the potential — this gives a way 
of searching for those eigenenergies. If the wavefunction diverges up- 
wards for E' and downwards for E" then we know that an eigenenergy 
of the system lies between these two values, E' < E^ < E" . Test- 
ing further values between these upper and lower bounds narrows the 
range and gives a more precise value of Ek (as in the Newton-Raphson 
method for finding roots). This so-called ‘shooting’ method is the least 
sophisticated method of computing wavefunctions and energies, but it 
is adequate for illustrating the principles of such calculations. Results 
are not given here since they can readily be calculated — the reader is 
strongly encouraged to implement the numerical method of solution, us- 
ing a spreadsheet program, as described in Exercise 4.10. This shows 
how to find the wavefunctions for an electron in an arbitrary potential 
and verifies that the energy levels obey a quantum defect formula such 
as eqn 4.1 in any potential that is proportional to 1/r at long range (see 
Fig. 4.7). 



1 3 

The step size Sr must be small com- 
pared to the distance over which the 
wavefunction varies; but the number of 
steps must not be so large that round- 
off errors begin to dominate. 



14 In a numerical method there is no 
reason why we should not calculate the 
wavefunction directly; P (r) was intro- 
duced to make the equations neater in 
the analytical approach. 
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Fig. 4.7 Simple modifications of the 
potential energy that could be used 
for the numerical solution of the 
Schrodinger equation described in Ex- 
ercise 4.10. For all these potentials 
V{r) = -e 2 /4:7Teor for r ^ r co re - (a) 
Inside the radial distance r co re the po- 
tential energy is V(r) = —Ze 2 /4:7reor + 
^offset 5 drawn here for Z = 3 and an 
offset chosen so that V (r) is continuous 
at r = r C ore • This corresponds to the 
situation where the charge of the core is 
an infinitely thin shell. The deep poten- 
tial in the inner region means that the 
wavefunction has a high curvature, so 
small steps must be used in the numeri- 
cal calculation (in this region). The hy- 
pothetical potentials in (b) and (c) are 
useful for testing the numerical method 
and for showing why the eigenenergies 
of any potential proportional to 1/r at 
long range obey a quantum defect for- 
mula (like eqn 4.1). The form of the 
solution depends sensitively on the en- 
ergy in the outer region r ^ r C ore, but 
in the inner region where \E\ <C |F(r)| 
it does not, e.g. the number of nodes 
(‘wiggles’) in this region changes slowly 
with energy E. Thus, broadly speak- 
ing, the problem reduces to finding the 
wavefunction in the outer region that 
matches boundary conditions, at r = 
r CO re, that are almost independent of 
the energy — the potential energy curve 
shown in (b) is an extreme example 
that gives useful insight into the be- 
haviour of the wavefunction for more 
realistic central fields. 



(a) 




4.4.1 Self-consistent solutions 

The numerical method described above, or a more sophisticated one, 
can be used to find the wavefunctions and energies for a given potential 
in the central-field approximation. Now we shall think about how to de- 
termine Vcf itself. The potential of the central field in eqn 4.2 includes 
the electrostatic repulsion of the electrons. To calculate this mutual 
repulsion we need to know where the electrons are, i.e. their wavefunc- 
tions, but to find the wavefunctions we need to know the potential. This 
argument is circular. However, going round and round this loop can be 
useful in the following sense. As stated above, the method starts by 
making a reasonable estimate of Vcf and then computing the electronic 
wavefunctions for this potential. These wavefunctions are then used to 
calculate a new average potential (using the central-field approximation) 
that is more realistic than the initial guess. This improved potential is 
then used to calculate more accurate wavefunctions, and so on. On sue- 
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cessive iterations, the changes in the potential and wavefunctions should 
get smaller and converge to a self-consistent solution, i.e. where the wave- 
functions give a certain Vcf(^), and solving the radial equation for that 
central potential gives back the same wavefunctions (within the required 
precision). 15 This self-consistent method was devised by Hartree. How- 
ever, the wavefunctions of multi-electron atoms are not simply products 
of individual wavefunctions as in eqn 4.5. In our treatment of the excited 
configurations of helium we found that the two-electron wavefunctions 
had to be antisymmetric with respect to the permutation of the electron 
labels. This symmetry requirement for identical fermions was met by 
constructing symmetrised wavefunctions that were linear combinations 
of the simple product states (i.e. the spatial part of these functions is 
V^space an d V^space) • A convenient way to extend this symmetrisation to 
N particles is to write the wavefunction as a Slater determinant: 





V’a(l) 


V’a (2) 


• Vv(A0 


'f = — u 

Vn 


V> bO- ) 


^(2) 


• V> 6 (AO 


V’c(l) 


V’c (2) 


■ V’c (AO 


V’x ( 1 ) 


V’x (2) •• 


• V’x (AO 



Here a, 6, c, . . . , x are the possible sets of quantum numbers of the in- 
dividual electrons, 16 and 1, 2, . . . , TV are the electron labels. The change 
of sign of a determinant on the interchange of two columns makes the 
wavefunction antisymmetric. The Hartree-Fock method uses such sym- 
metrised wavefunctions for self-consistent calculations and nowadays this 
is the standard way of computing wavefunctions, as described in Brans- 
den and Joachain (2003). In practice, numerical methods need to be 
adapted to the particular problem being considered, e.g. numerical val- 
ues of the radial wavefunctions that give accurate energies may not give 
a good value for a quantity such as the expectation value (l/r 3 ) that is 
very sensitive to the behaviour at short range. 



4.5 The spin— orbit interaction: a quantum 
mechanical approach 

The spin-orbit interaction /3s • 1 (see eqn 2.49) splits the energy levels to 
give fine structure. For the single valence in an alkali we could treat this 
interaction in exactly the same way as for hydrogen in Chapter 2, i.e. use 
the vector model that treats the angular momenta as vectors obeying 
classical mechanics (supplemented with rules such as the restriction of 
the angular momentum to integer or half-integer values). However, in 
this chapter we shall use a quantum mechanical treatment and regard the 
vector model as a useful physical picture that illustrates the behaviour 
of the quantum mechanical operators. The previous discussion of fine 
structure in terms of the vector model had two steps that require further 
justification. 
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15 The number of iterations required, 
before the changes when going round 
the loop become very small, depends 
on how well the initial potential is cho- 
sen, but the final self-consistent solu- 
tion should not depend on the initial 
choice. In general, it is better to let a 
computer do the work rather than ex- 
pend a lot of effort improving the start- 
ing point. 



16 Including both space and spin. 
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1 7 

The wavefunction for an alkali metal 
atom in the central-field approxima- 
tion is a product of a radial wavefunc- 
tion (which does not have an analyti- 
cal expression) and angular momentum 
eigenfunctions (as in hydrogen). 

1 8 

More explicitly, we have 
| lmism s ) = Yz,rrq V’spin, where 

V’spin = | m s = +1/2) or I m s = -1/2). 

19 Proof of these commutation re- 
lations: [s x lx + Syly + S Z l Z , l Z \ — 

s x [lx,lz\-\~Sy [/ydd — is X ly + ISy lx 7^ 
0. Similarly, [s x l x + s y l y + s z l z , s z ] = 
— i s y l x + i s x l y ^ 0. Note that 

[s • 1, l z \ = — [s • 1, s z ] and hence s • 1 
commutes with l z + s z . 




Fig. 4.8 The total angular momen- 
tum of the atom j = 1 + s is a fixed 
quantity in the absence of an exter- 
nal torque. Thus an interaction be- 
tween the spin and orbital angular mo- 
menta /3s • 1 causes these vectors to ro- 
tate (precess) around the direction of j 
as shown. 

20 

As for helium in Section 3.2 and in 
the classical treatment of the normal 
Zeeman effect in Section 1.8. 

21 

These commutation relations for the 
operators correspond to the conserva- 
tion of the total angular momentum, 
and its component along the z- axis. 
Only an external torque on the atom 
affects these quantities. The spin-orbit 
interaction is an internal interaction. 



(a) The possible values of the total angular momentum obtained by 
the addition of the electron’s spin, s = 1/2, and its orbital angular 
momentum are j = / + l/2or/ — 1/2. This is a consequence of the 
rules for the addition of angular momentum in quantum mechanics 
(vector addition but with the resultant quantised). 

(b) The vectors have squared magnitudes given by j 2 = j(j + 1), l 2 = 
/(/ + 1) and s 2 = 3/4, where j and l are the relevant angular mo- 
mentum quantum numbers. 

Step (b) arises from taking the expectation values of the quantum op- 
erators in the Hamiltonian for the spin-orbit interaction. This is not 
straightforward since the atomic wavefunctions R(r ) 1 1 mi sm s ) are not 
eigenstates of this operator 17 — this means that we must face the com- 
plications of degenerate perturbation theory. This situation arises fre- 
quently in atomic physics and merits a careful discussion. 

We wish to determine the effect of an interaction of the form s • 1 
on the angular eigenfunctions \lmi sm s ). These are eigenstates of the 
operators l 2 , l z , s 2 and s z labelled by the respective eigenvalues. 18 There 
are 2(21 + 1) degenerate eigenstates for each value of l because the energy 
does not depend on the orientation of the atom in space, or the direction 
of its spin, i.e. energy is independent of mi and m s . The states | Imi sm s ) 
are not eigenstates of s • 1 because this operator does not commute with 
l z and s z : [s • 1, l z \ 0 and [s • 1, s z \ 0. 19 Quantum operators only 

have simultaneous eigenfunctions if they commute. Since \lmism s ) is 
an eigenstate of l z it cannot simultaneously be an eigenstate of s • 1, and 
similarly for s z . However, s-1 does commute with l 2 and s 2 : [s • 1, l 2 ] = 0 
and [s • 1, s 2 ] = 0 (which are easy to prove since s x , s y: s z , l x , l y and 
l z all commute with s 2 and l 2 ). So / and s are good quantum numbers 
in fine structure. Good quantum numbers correspond to constants of 
motion in classical mechanics — the magnitudes of 1 and s are constant 
but the orientations of these vectors change because of their mutual 
interaction, as shown in Fig. 4.8. If we try to evaluate the expectation 
value using wavefunctions that are not eigenstates of the operator then 
things get complicated. We would find that the wavefunctions are mixed 
by the perturbation, i.e. in the matrix formulation of quantum mechanics 
the matrix representing the spin-orbit interaction in this basis has off- 
diagonal elements. The matrix could be diagonalised by following the 
standard procedure for finding the eigenvalues and eigenvectors, 20 but 
a p-electron gives six degenerate states so the direct approach would 
require the diagonalisation of a 6 x 6 matrix. It is much better to find the 
eigenfunctions at the outset and work in the appropriate eigenbasis. This 
dook-before-you-leap’ approach requires some preliminary reasoning. 

We define the operator for the total angular momentum as j = 1 + s. 
The operator j 2 commutes with the interaction, as does its component 
j z : [s • 1, j 2 ] = 0 and [s • 1, j z \ = 0. Thus j and mj are good quantum 
numbers. 21 Hence suitable eigenstates for calculating the expectation 
value of s • 1 are | Isjmj). Mathematically these new eigenfunctions can 
be expressed as combinations of the old basis set: 
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I Isjmj) = E C(lsjrrij ; m^m s ) \l mi s m s ) . 

mi ,ui s 



Each eigenfunction labelled by /, s, j and ray is a linear combination of 
the eigenfunctions with the same values of l and s but various values 
of mi and m s . The coefficients C are the Clebsch-Gordan coefficients 
and their values for many possible combinations of angular momenta 
are tabulated in more advanced books. Particular values of Clebsch- 
Gordan coefficients are not needed for the problems in this book but 
it is important to know that, in principle, one set of functions can be 
expressed in terms of another complete set — with the same number of 
eigenfunctions in each basis. 

Finally, we use the identity 22 j 2 = l 2 + s 2 + 2s • 1 to express the 
expectation value of the spin-orbit interaction as 

(Isjmj | s • 1 1 Isjmj) = \ (Isjmj | j 2 — l 2 — s 2 | Isjmj) 

= \ {j{j + 1) — Z (Z + 1) — s (s + 1)} . 



22 

This applies both for vector opera- 
tors, where j 2 = j x 2 + j y 2 + j z 2 , and 
for classical vectors where this is simply 
J 2 = IJI 2 - 



The states | Isjmj) are eigenstates of the operators j 2 , l 2 and s 2 . The 
importance of the proper quantum treatment may not yet be apparent 
since all we appear to have gained over the vector model is being able to 
write the wavefunctions symbolically as | Isjmj). We will, however, need 
the proper quantum treatment when we consider further interactions 
that perturb these wavefunctions. 
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The fine structure in the alkalis is well approximated by an empirical 
modification of eqn 2.56 called the Lande formula: 



E 2 E 2 

AEps = q 2 — a 2 hcRt 

(n*) 3 1(1 + 1) 



(4.13) 



In the denominator the effective principal quantum number cubed (n*) 3 
(defined in Section 4.2) replaces n 3 . The effective atomic number Z e ff, 
which was defined in the discussion of the central-field approximation, 
tends to the inner atomic number Z\ ~ Z as r — > 0 (where the electron 
‘sees’ most of the nuclear charge); outside the core the field corresponds 
to an outer atomic number Z Q ~ 1 (for neutral atoms). The Lande 
formula can be justified by seeing how the central- field approximation 
modifies the calculation of the fine structure in hydrogen (Section 2.3.2). 
The spin-orbit interaction depends on the electric field that the electron 
moves through; in an alkali metal atom this field is proportional to 
Z eft if )r/r 3 rather than r/r 3 as in hydrogen. 23 Thus the expectation 
value of the spin-orbit interaction depends on 

^eff(r) ^ _ ^ 1 dV CF (r) \ 



23 

This modification is equivalent to us- 
ing Vcf in place of the hydrogenic po- 
tential proportional to 1/r. 



er dr 
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Fig. 4.9 The fine-structure compo- 
nents of a p to s transition, e.g. the 
3 Si/ 2 - 3 P1/2 and 3 Si/ 2 _ 3 P3/2 transi- 
tions in sodium. (Not to scale.) The 
statistical weights of the upper levels 
lead to a 1:2 intensity ratio. 

24 

The rates of the allowed transitions 
depend on integrals involving the radial 
wavefunctions (carried out numerically 
for the alkalis) and the integrals over 
the angular part of the wavefunction 
given in Section 2.2.1, where we derived 
the selection rules. 

25 This shortened form of the full LS- 
coupling scheme notation gives all the 
necessary information for a single elec- 
tron, cf. 3s 2 Si / 2 — 3p 2 P 3/ / 2 - 

26 

This must be true for the physical 
reason that the decay rate is the same 
whatever the spatial orientation of the 
atom, and similarly for the spin states. 
All the different angular states have the 
same radial integral, i.e. that between 
the 3p and 3s radial wavefunctions. 

27 

This normal situation for fine struc- 
ture may be modified slightly in a case 
like caesium where the large separa- 
tion of the components means that the 
frequency dependence of the lifetime 
(eqn 1.24) leads to differences, even 
though the matrix elements are similar. 



rather than (l/r 3 ) as in hydrogen (eqn 2.51). This results in fine struc- 
ture for the alkalis, given by the Lande formula, that scales as Z 2 — this 
lies between the dependence on Z 4 for hydrogenic ions (no screening) 
and the other extreme of no dependence on atomic number for complete 
screening. The effective principal quantum number 77 * is remarkably 
similar across the alkalis, as noted in Section 4.2. 

As a particular numerical example of the scaling, consider the fine 
structure of sodium (Z = 11) and of caesium (Z = 55). The 3p con- 
figuration of sodium has a fine-structure splitting of 1700 m -1 , so for 
a Z 2 -dependence the fine structure of the 6p configuration of caesium 
should be (using 77 * from Table 4.2) 

1.7 x 10 3 x ^YX^) x (fi) = 28.5 x lC^m- 1 . 

This estimate gives only half the actual value of 55.4 x 10 3 m -1 , but 
the prediction is much better than if we had used a Z 4 scaling. (A 
logarithmic plot of the energies of the gross and fine structure against 
atomic number is given in Fig. 5.7. This shows that the actual trend of 
the fine structure lies close to the Z 2 -dependence predicted.) 

The fine structure causes the familiar yellow line in sodium to be a 
doublet comprised of the two wavelengths A = 589.0 nm and 589.6 nm. 
This, and other doublets in the emission spectrum of sodium, can be 
resolved by a standard spectrograph. In caesium the transitions be- 
tween the lowest energy configurations (6s-6p) give spectral lines at 
A = 852 nm and 894 nm — this ‘fine structure’ is not very fine. 

4.6.1 Relative intensities of fine-structure 
transitions 

The transitions between the fine-structure levels of the alkalis obey the 
same selection rules as in hydrogen since the angular momentum func- 
tions are the same in both cases. It takes a considerable amount of cal- 
culation to find absolute values of the transition rates 24 but we can find 
the relative intensities of the transitions between different fine-structure 
levels from a simple physical argument. As an example we shall look 
at p to s transitions in sodium, as shown in Fig. 4.9. The 3 Si/ 2 -3Pi/ 2 
transition has half the intensity of the 3 Si/ 2 -3P3/ 2 transition. 25 This 
1:2 intensity ratio arises because the strength of each component is pro- 
portional to the statistical weight of the levels (2j + 1). This gives 2:4 
for j = 1/2 and 3/2. To explain this we first consider the situation 
without fine structure. For the 3p configuration the wavefunctions have 
the form R 3 P (r) \lmism s ) and the decay rate of these states (to 3s) is 
independent of the values of mi and m s 26 Linear combinations of the 
states R(r) \lmism s ) with different values of mi and m s (but the same 
values of 77-, / and 8, and hence the same lifetime) make up the eigen- 
states of the fine structure, \lsjmj). Therefore an alkali atom has the 
same lifetime for both values of j. 27 
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If each state has the same excitation rate, as in a gas discharge lamp 
for example, then all the states will have equal populations and the 
intensity of a given component of the line is proportional to the number 
of contributing mj states. Similarly, the fine structure of transitions 
from s to p configurations, e.g. 3P 3 / 2 -5Si/ 2 and 3Pi/ 2 -5Si/ 2 , have an 
intensity ratio of 2:1 — in this case the lower frequency component has 
twice the intensity of the higher component, i.e. the opposite of the p 
to s transition shown in Fig. 4.9 (and such information can be used to 
identify the lines in an observed spectrum). More generally, there is a 
sum rule for intensities: the sum of the intensities to, or from, a given 
level is proportional to its degeneracy; this can be used when both upper 
and lower configurations have fine structure (see Exercise 4.8). 

The discussion of the fine structure has shown that spin leads to a 
splitting of energy levels of a given n, of which l levels have different 
j. These fine-structure levels are degenerate with respect to mj, but an 
external magnetic field removes this degeneracy. The calculation of the 
effect of an external magnetic field in Chapter 1 was a classical treat- 
ment that led to the normal Zeeman effect. This does not accurately 
describe what happens for atoms with one valence electron because the 
contribution of the spin magnetic moment leads to an anomalous Zee- 
man effect. The splitting of the fine-structure level into 2 j + 1 states 
(or Zeeman sub-levels) in an applied field is shown in Fig. 4.10. It is 
straightforward to calculate the Zeeman energy for an atom with a single 
valence electron, as shown in quantum texts, but to avoid repetition the 
standard treatment is not given here; in the next chapter we shall derive 
a general formula for the Zeeman effect on atoms with any number of 
valence electrons that covers the single-electron case (see Exercise 5.13). 
We also look at the Zeeman effect on hyperfine structure in Chapter 6. 




Fig. 4.10 In an applied magnetic field 
of magnitude B the four states of differ- 
ent mj of the 2 P 3 / 2 level have energies 
of ^Zeeman = gjH^Brrij — the factor gj 
arises from the projection of the contri- 
butions to the magnetic moment from 
1 and s onto j (see Exercise 5.13). 



Further reading 

This chapter has concentrated on the alkalis and mentioned the neigh- 
bouring inert gases; a more general discussion of the periodic table is 
given in Physical chemistry by Atkins (1994). 

The self-consistent calculations of atomic wavefunctions are discussed 
in Hartree (1957), Slater (1960), Cowan (1981), in addition to the text- 
book by Bransden and Joachain (2003). 

The numerical solution of the Schrodinger equation for the bound 
states of a central field in Exercise 4.10 is discussed in French and Taylor 
(1978), Eisberg and Resnick (1985) and Rioux (1991). Such numerical 
methods can also be applied to particles with positive energies in the 
potential to model scattering in quantum mechanics, as described in 
Greenhow (1990). The numerical method described in this book has 
deliberately been kept simple to allow quick implementation, but the 
Numerov method is more precise for this type of problem. 
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Exercises 

(4.1) Configuration of the electrons in francium 
Write down the full electronic configuration of 
francium (atomic number Z — 87). This element 
comes below caesium in the periodic table. 

(4.2) Finding the series limit for sodium 

Eight ultraviolet absorption lines in sodium have 
wavenumbers of 

38 541, 39 299, 39 795, 40137, 

40 383, 40 566, 40 706, 40 814, 

in units of cm -1 . Devise an extrapolation proce- 
dure to find the ionization limit of sodium with a 
precision justified by the data. Convert the result 
into electron volts. (You may find a spreadsheet 
program useful for manipulating the numbers.) 
What is the effective principal quantum number 
n* of the valence electron in the ground configu- 
ration? 

(4.3) Quantum defects of sodium 

The binding energies of the 3s, 4s, 5s and 6s con- 
figurations in sodium are 5.14 eV, 1.92 eV, 1.01 eV 
and 0.63 eV, respectively. Calculate the quantum 
defects for these configurations and comment on 
what you find. 

Estimate the binding energy of the 8s configura- 
tion and make a comparison with the n — 8 shell 
in hydrogen. 

(4.4) Quantum defect 

Estimate the wavelength of laser radiation that ex- 
cites the 5s 2 Si/2~7s 2 Si /2 transition in rubidium 
by simultaneous absorption of two photons with 
the same frequency (IE(Rb) = 4.17eV). (Two- 
photon spectroscopy is described in Section 8.4 but 
specific details are not required here.) 

(4.5) Application of quantum defects to helium and 
helium-like ions 



Configuration 


Binding energy (cm 1 ) 


ls2s 


35 250 


ls2p 


28 206 


ls3s 


14 266 


ls3p 


12 430 


ls3d 


12 214 



(a) Calculate the wavelength of the Is2p-ls3d line 
in helium and compare it with the Balmer-a 
line in hydrogen. 

(b) Calculate the quantum defects for the config- 
urations of helium in the table. Estimate the 
binding energies of the ls4 1 configurations. 

(c) The levels belonging to the ls4f configuration 
of the Li + ion all lie at an energy of 72.24 eV 
above the ion’s ground state. Estimate the 
second ionization energy of this ion. Answer: 
75.64 eV. 

(4.6) Quantum defects and fine structure of potassium 

An atomic vapour of potassium absorbs light at 
the wavelengths (in nm): 769.9, 766.5, 404.7, 

404.4, 344.7 and 344.6. These correspond to the 
transitions from the ground configuration 4s. Ex- 
plain these observations as fully as you can and 
estimate the mean wavelength of the next doublet 
in the series, and its splitting. (Potassium has 
IE = 4.34 eV.) 28 

(4.7) The Z -scaling of fine structure 

Calculate the fine-structure splitting of the 3p con- 
figuration of the hydrogen- like ion Na +1 ° (in eV). 
Explain why it is larger than the fine structure 
of the same configuration in the neutral sodium 
(0.002 eV) and hydrogen (1.3 x 10 _5 eV). 

(4.8) Relative intensities of fine- structure components 

(a) An emission line in the spectrum of an al- 
kali has three fine-structure components cor- 
responding to the transitions 2 P 3 /2- 2 D3/2, 
2 P 3 / 2 - 2 D 5/2 and 2 Pi/ 2 - 2 D 3/ / 2 . These compo- 
nents have intensities a, b and c, respectively, 
that are in the ratio 1:9:5. Show that these 
satisfy the rule that the sum of the intensities 
of the transitions to, or from, a given level is 
proportional to its statistical weight (2 J + 1). 

(b) Sketch an energy-level diagram of the fine- 
structure levels of the two terms nd 2 D and 
n' f 2 F (for n! > n). Mark the three allowed 
electric dipole transitions and find their rela- 
tive intensities. 



28 For a discussion of how to determine the quantum defect for a series of lines by an iterative method see Softley (1994). 
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(4.9) Spherical symmetry of a full sub-shell 

The sum s f2!m=-i l^,m| 2 i s spherically symmetric. 
Show this for the specific case of l = 1 and com- 
ment on the relevance of the general expression, 
that is true for all values of l, to the central-held 
approximation . 

(4.10) Numerical solution of the Schrodinger equation 

This exercise goes through a method of finding the 
wavefunctions and their energies for a potential (in 
the central-held approximation). This shows how 
numerical calculations are carried out in a simple 
case that can be implemented easily on a computer 
with readily available spreadsheet programs. 29 Of 
course, the properties of hydrogen-like atoms are 
well known and so the hrst stage really serves 
as a way of testing the numerical method (and 
checking that the formulae have been typed cor- 
rectly). It is straightforward to extend the nu- 
merical method to deal with other cases, e.g. the 
potentials in the central-held approximation illus- 
trated in Fig. 4. 7. 30 



(a) Derivation of the equations 

Show from eqn 2.4, and other equations in 
Chapter 2, that 

S + Is + ( g - i> w) i *w = °'< 414 > 

where the position and energy have been 
turned into dimensionless variables: x = r/ao 
and E is the energy in units of e 2 /87reoao — 
13.6 eV (equal to half the atomic unit of en- 
ergy used in some of the references). 31 In 
these units the effective potential is 



V(x) 



1(1 + 1 ) 



2 

5 

X 



(4.15) 



where l is the orbital angular momentum 
quantum number. 

The derivatives of a function f (x) can be ap- 
proximated by 



where S is a small step size. 32 
Show that the second derivative follows by ap- 
plying the procedure used to obtain the hrst 
derivative twice. Show also that substitution 
into eqn 4.14 gives the following expression for 
the value of the function at x + <5 in terms of 
its value at the two previous points: 



R(x + 5) = |2i?(a;) + (v(x) - E^j R(x)5 2 

'l + 6 -' 

X J 

(4.16) 

If we start the calculation near the origin then 



-(i- -)*(*-«) 



(25) = \{2+{v(5)-E) 

(35) = 1 (21? (25) + (v (2(5) -EjR (25) 5 2 

+ R(S)}- 



etc. Note that in the hrst equation the value of 
R (x) at x = 26 depends only on R ((5) — it can 
easily be seen why by inspection of eqn 4.16 
for the case of x = <5 (for this value of x the 
coefficient of R (0) is zero). Thus the calcula- 
tion starts at x — S and works outwards from 
there. 33 At all other positions (x > (5) the 
value of the function depends on its values at 
the two preceding points. From these recur- 
sion relations we can calculate the function at 
all subsequent points. 

The calculated functions will not be nor- 
malised and the starting conditions can be 
multiplied by an arbitrary constant without 
affecting the eigenenergies, as will become 
clear from looking at the results. In the fol- 
lowing R(S) = 1 is the suggested choice but 
any starting value works. 

(b) Implementation of the numerical method 
using a spreadsheet program 
Follow these instructions. 



d/ j f (x + (5/2) + f (x — (5/2) 
dx S 

d 2 / _ f (x + 6) + f (x - S) - 2f (x) 
dx 2 d 2 



1. Type the given text labels into cells Al, Bl, 
Cl, D2, E2 and F2 and the three numbers 
into cells Dl, El and FI so that it has the 
following form: 



29 With a spreadsheet it is very easy to make changes, e.g. to find out how different potentials affect the eigenenergies and 
wavefunctions. 

30 It is intended to put more details on the web site associated with this book, see introduction for the address. 

31 The electron mass m e = 1 in these units. Or, more strictly, its reduced mass. 

32 This abbreviation should not be confused with the quantum defect. 

33 This example is an exception to the general requirement that the solution of a second-order differential equation, such as 
that for a harmonic oscillator, requires a knowledge of the function at two points to define both the value of the function and 
its derivative. 



78 The alkalis 



A B 


c 


D 


E 


F 


1 x V (x) 


psi 


0.02 


-0.25 


1 


2 




step 


energy 


ang . mom . 



Column A will contain the x-coordinates, 
the potential will be in column B and the 
function in column C. Cells Dl, El and FI 
contain the step size, energy and orbital an- 
gular momentum quantum number (l = 1), 
respectively. 

2. Put 0 into A2 and the formula =A2+$D$1 
into A3. Copy cell A3 to the block 
A4:A1002. (Or start with a smaller num- 
ber of steps and adjust Dl accordingly.) 

3. The potential diverges at x = 0 so type 
inf . into B2 (or leave it blank, remember- 
ing not to refer to it). 

Put the formula 

=-2/A3 +$F$1*($F$1+1)/(A3*A3) 

into cell B3 (as in eqn 4.15). Copy B3 into 
the block B4:B1002. 

4. This is the crucial stage that calculates 
the function. Type the number 1 into cell 
C3. (We leave C2 blank since, as explained 
above, the value of the function at x — 0 
does not affect the solution given by the re- 
cursion relation in eqn 4.16.) Now move to 
cell C4 and enter the following formula for 
the recursion relation: 

=( 2*C3+(B3-$E$1)*C3*$D$1*$D$1 

- (1-$D$1/A3) *C2 )/ (1+$D$1/A3). 

Copy this into the block C5:C1002. Create 
an xy - plot of the wavefunction (with data 
points connected by smooth lines and no 
markers); the x series is A2:A1002 and the 
y series is C2:C1002. Insert this graph on 
the sheet. 

5. Now play around with the parameters and 
observe the effect on the wavefunction for 
a particular energy. 

(i) Show that the initial value of the func- 
tion does not affect its shape, or the 
eigenenergy, by putting 0 . 1 (or any 
number) into cell C3. 

(ii) Change the energy, e.g. put -0.251 
into cell El, then -0.249, and ob- 
serve the change in behaviour at large 



x. (The divergence is exponential, so 
even a small energy discrepancy gives 
a large effect.) Try the different ener- 
gies again with bigger and smaller step 
sizes in Dl. It is important to search 
for the eigenenergy using an appropri- 
ate range of x. The eigenenergy lies 
between the two values of the trial en- 
ergy that give opposite divergence, i.e. 
upwards and downwards on the graph. 

(iii) Change FI to 0 and find a solution for 
l = 0 . 

6. Produce a set of graphs labelled clearly 
with the trial energy that illustrate the 
principles of the numerical solution, for the 
two functions with n = 2 and two other 
cases. Compare the eigenenergies with the 
Bohr formula. 

Calculate the effective principal quantum 
number for each of the solutions, e.g. by 
putting =SQRT (-1/E1) in G1 (and the label 
n* in G2). 

(The search for eigenenergies can be auto- 
mated by exploiting the spreadsheet’s abil- 
ity to optimise parameters subject to con- 
straints (e.g. the ‘Goal Seek’ command, or 
similar) . Ask the program to make the last 
value of the function (in cell C1002) have 
the value of zero by adjusting the energy 
(cell El). This procedure can be recorded 
as a macro that searches for the eigenener- 
gies with a single button click.) 

7. Implement one, or more, of the following 
suggestions for improving the basic method 
described above. 

(i) Find the eigenenergies for a potential 
that tends to the Coulomb potential 
(— 2/x in dimensionless units) at long 
range, like those shown in Fig. 4.7, 
and show that the quantum defects for 
that potential depend on l but only 
weakly on n. 

(ii) For the potential shown in Fig. 4.7(c) 
compare the wavefunction in the inner 
and outer regions for several different 
energies. Give a qualitative explana- 
tion of the observed behaviour. 

(iii) Calculate the function P(r) = rR(r) 
by putting A3*C3 in cell D3 and copy- 
ing this to the rest of the column. 
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Make a plot of P(r), R(r) and V (r) for 
at least two different values of n and 
l. Adjust the value in C3, as in stage 
5(i), to scale the functions to conve- 
nient values for plotting on the same 
axes as the potential. 

(iv) Attempt a semi-quantitative calcula- 
tion of the quantum defects in the 
lithium atom, e.g. model Vcf (r) as in 
Fig. 4.7(a) for some reasonable choice 
of r 34 

'-'I / core- 

(v) Numerically calculate the sum of 
r 2 R 2 (r) 5 for all the values of the 
function and divide through by its 
square root to normalise the wave- 
function. With normalised functions 
(stored in a column of the spread- 



sheet) you can calculate the electric 
dipole matrix elements (and their ra- 
tios), e.g. |(3p| r 1 2s) | 2 / |(3p| r | Is) | 2 = 
36, as in Exercise 7.6 (not forgetting 
the to 3 factor from eqn 7.23). 

(vi) Assess the accuracy of this numerical 
method by calculating some eigenen- 
ergies using different step sizes. (More 
sophisticated methods of numerical in- 
tegration provided in mathematical 
software packages can be compared to 
the simple method, if desired, but the 
emphasis here is on the atomic physics 
rather than the computation. Note 
that methods that calculate higher 
derivatives of the function cannot cope 
with discontinuities in the potential.) 



Web site: 

http : //www. physics . ox.ac.uk/users/foot 

This site has answers to some of the exercises, corrections and other supplementary information. 



34 This simple model corresponds to all the inner electron charge being concentrated on a spherical shell. Making the tran- 
sition from the inner to outer regions smoother does not make much difference to the qualitative behaviour, as you can check 
with the program. 
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1 Choosing S(r) to account for all 
the repulsion between the spherically- 
symmetric core and the electrons out- 
side the closed shells, and also within 
the core, leaves the repulsion between 
the two valence electrons, i.e. H re ~ 
e 2 /47reori2. This approximation high- 
lights the similarity with helium (al- 
though the expectation value is eval- 
uated with different wavefunctions) . 
Although it simplifies the equations 
nicely, this is not the best approxi- 
mation for accurate calculations — S(r) 
can be chosen to include most of the 
direct integral (cf. Section 3.3.2). For 
alkali metal atoms, which we studied in 
the last chapter, the repulsion between 
electrons gives a spherically-symmetric 
potential, so that H re = 0. 

2 

For two p-electrons we cannot ignore 
mi as we did in the treatment of Is nl 
configurations in helium. Configura- 
tions with one, or more, s-electrons can 
be treated in the way already described 
for helium but with the radial wave- 
functions calculated numerically. 



The LS^-coupling scheme 



In this chapter we shall look at atoms with two valence electrons, e.g. al- 
kaline earth metals such as Mg and Ca. The structures of these elements 
have many similarities with helium, and we shall also use the central- 
field approximation that was introduced for the alkalis in the previous 
chapter. We start with the Hamiltonian for N electrons in eqn 4.2 and 
insert the expression for the central potential Vqf (r) (eqn 4.3) to give 



N 



i = 1 




+ V CF (t**) + 




2 /47re 0 




This Hamiltonian can be written as H = Hqf + H re , where the central- 
field Hamiltonian Hqf is that defined in eqn 4.4 and 



e(e^-»(,)} <m> 

i=l [j>i J 

is the residual electrostatic interaction. This represents that part of the re- 
pulsion not taken into account by the central field. One might think that 
the field left over is somehow non-central. This is not necessarily true. 
For configurations such as ls2s in He, or 3s4s in Mg, both electrons have 
spherically-symmetric distributions but a central field cannot completely 
account for the repulsion between them — a potential Vcf(f) does not in- 
clude the effect of the correlation of the electrons’ positions that leads 
to the exchange integral. 1 The residual electrostatic interaction perturbs 
the electronic configurations that are the eigenstates of the cen- 

tral field. These angular momentum eigenstates for the two electrons are 
products of their orbital and spin functions |ZimqSim Sl ) \l 2 r mi 2 S 2 r ms 2 ) 
and their energy does not depend on the atom’s orientation so that all 
the different mi states are degenerate, e.g. the configuration 3p4p has 
(2Zi + 1) ( 2/2 + 1) = 9 degenerate combinations of Yj 2?m2 . 2 Each 
of these spatial states has four spin functions associated with it, but 
we do not need to consider thirty-six degenerate states since the prob- 
lem separates into spatial and spin parts, as in helium. Nevertheless, 
the direct approach would require diagonalising matrices of larger di- 
mensions than the simple 2x2 matrix whose determinant was given in 
eqn 3.17. Therefore, instead of that brute- force approach, we use the 
‘look-before-you-leap’ method that starts by finding the eigenstates of 
the perturbation H re . In that representation, H re is a diagonal matrix 
with the eigenvalues as its diagonal elements. 
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The interaction between the electrons, from their electrostatic repul- 
sion, causes their orbital angular momenta to change, i.e. in the vector 
model li and I 2 change direction, but their magnitudes remain constant. 
This internal interaction does not change the total orbital angular mo- 
mentum L = li + 1 2 , so li and I 2 move (or precess) around this vector, as 
illustrated in Fig. 5.1. When no external torque acts on the atom, L has 
a fixed orientation in space so its z-component Ml is also a constant of 
the motion (raq and m/ 2 are not good quantum numbers). This classical 
picture of conservation of total angular momentum corresponds to the 
quantum mechanical result that the operators L 2 and L z both commute 
with H re : 3 

[L 2 , H re ] = 0 and [L z ,H le } = 0. (5.2) 

Since H re does not depend on spin it must also be true that 

[S 2 ,H re ] = 0 and [S z ,H te \= 0. (5.3) 

Actually, H re also commutes with the individual spins si and S 2 but 
we chose eigenfunctions of S to antisymmetrise the wavefunctions, as 
in helium — the spin eigenstates for two electrons are V^i n and ^ in for 
S = 0 and 1, respectively. 4 The quantum numbers L,Ml,S and Ms 
have well-defined values in this Russel l-Saunders or LS'-coupling scheme. 
Thus the eigenstates of H re are \LMlSMs). In the L^-coupling scheme 
the energy levels labelled by L and S are called terms (and there is 
degeneracy with respect to Ml and Ms). We saw examples of X L and 
3 L terms for the Is nl configurations in helium where the LS'-coupling 
scheme is a very good approximation. A more complex example is an 
npn'p configuration, e.g. 3p4p in silicon, that has six terms as follows: 

li = 1, ^2 = 1 => L = 0, 1 or 2 , 

5i = i, s 2 = ^ => S' = 0 or 1 ; 

terms: 2S+1 L = X S, 1 P, X D, 3 S, 3 P, 3 D . 

The direct and exchange integrals that determine the energies of these 
terms are complicated to evaluate (see Woodgate (1980) for details) 
and here we shall simply make some empirical observations based on 
the terms diagrams in Figs 5.2 and 5.3. The (2Zi + 1) (2 Z 2 + 1) = 9 
degenerate states of orbital angular momentum become the 1 + 3 + 
5 = 9 states of Ml associated with the S, P and D terms, respectively. 
As in helium, linear combinations of the four degenerate spin states 
lead to triplet and one singlet terms but, unlike helium, triplets do not 
necessarily he below singlets. Also, the 3p 2 configuration has fewer terms 
than the 3p4p configuration for equivalent electrons, because of the Pauli 
exclusion principle (see Exercise 5.6). 

In the special case of ground configurations of equivalent electrons the 
spin and orbital angular momentum of the lowest-energy term follow 
some empirical rules, called Hund’s rules: the lowest-energy term has 
the largest value of S consistent with the Pauli exclusion principle. 5 If 




Fig. 5.1 The residual electrostatic in- 
teraction causes li and I 2 to precess 
around their resultant L = li + I 2 . 

O 

The proof is straightforward for the 
quantum operator: L z = l\ z -\-l2z since 
mq = q always occurs with mj 2 = —q 
in eqn 3.30. 

4 The Hamiltonian H commutes with 
the exchange (or swap) operator Xij 
that interchanges the labels of the par- 
ticles i <-► j\ thus states that are simul- 
taneously eigenfunctions of both oper- 
ators exist. This is obviously true for 
the Hamiltonian of the helium atom in 
eqn 3.1 (which looks the same if 1 2), 

but it also holds for eqn 5.1. In general, 
swapping particles with the same mass 
and charge does not change the Hamil- 
tonian for the electrostatic interactions 
of a system. 



Two electrons cannot both have the 
same set of quantum numbers. 
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Fig. 5.2 The terms of the 3p4p configu- 
ration in silicon all lie about 6 eV above 
the ground state. The residual electro- 
static interaction leads to energy dif- 
ferences of ~ 0.2 eV between the terms, 
and the fine-structure splitting is an or- 
der of magnitude smaller, as indicated 
for the 3 P and 3 D terms. This structure 
is well described by the LS'-coupling 
scheme. 



Fig. 5.3 The energies of terms of the 
3p 2 configuration of silicon. For equiv- 
alent electrons the Pauli exclusion prin- 
ciple restricts the number of terms — 
there are only three compared to the 
six in Fig. 5.2. The lowest-energy term 
is 3 P, in accordance with Hund’s rules, 
and this is the ground state of silicon 
atoms. 

6 Hund’s rules are so commonly mis- 
applied that it is worth spelling out 
that they only apply to the lowest term 
of the ground configuration for cases 
where there is only one incomplete sub- 
shell. 

The large total spin has important 
consequences for magnetism (Blundell 
2001 ). 




1 . 8 - 

1 . 6 - 

1.4- 

1 . 2 - 

1 . 0 - 




Eqf 4 - E re -\-E s — o 



there are several such terms then the one with the largest L is lowest. 
The lowest term in Fig. 5.3 is consistent with these rules; 6 the rule says 
nothing about the ordering of the other terms (or about any of the 
terms in Fig. 5.2). Configurations of equivalent electrons are especially 
important since they occur in the ground configuration of elements in 
the periodic table, e.g. for the 3d 6 configuration in iron, Hund’s rules 
give the lowest term as 5 D (see Exercise 5. 6). 7 
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5.1 Fine structure in the XS'-coupling 
scheme 



Fine structure arises from the spin-orbit interaction for each of the un- 
paired electrons given by the Hamiltonian 

H s - 0 = /?iSi • ll + /3 2 S 2 • I2 • 



For atoms with two valence electrons H s - 0 acts as a perturbation on 
the states | LMlSMs)- In the vector model, this interaction between 
the spin and orbital angular momentum causes L and S to change di- 
rection, so that neither L z nor S z remains constant; but the total elec- 
tronic angular momentum J = L + S, and its ^-component J z , are both 
constant because no external torque acts on the atom. We shall now 
evaluate the effect of the perturbation H s _ 0 on a term using the vec- 
tor model. In the vector-model description of the L^-coupling scheme, 
li and I 2 precess around L, as shown in Fig. 5.4; the components per- 
pendicular to this fixed direction average to zero (over time) so that 
only the component of these vectors along L needs to be considered, 
e.g. li — ^ { (li • L) / |L| 2 }L. The time average li • L in the vector model 
becomes the expectation value (li • L) in quantum mechanics; also we 
have to use L(L + 1) for the magnitude-squared of the vector. Applying 
the same projection procedure to the spins leads to 
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(5.4) 



The derivation of this equation by the vector model that argues by 
analogy with classical vectors can be fully justified by reference to the 
theory of angular momentum. It can be shown that, in the basis | J Mj) 
of the eigenstates of a general angular momentum operator J and its 
component J 2 , the matrix elements of any vector operator V are pro- 
portional to those of J, i.e. ( J Mj\ V | J Mj) = c(J Mj\ J | J Mj). 8 Fig- 
ure 5.5 gives a pictorial representation of why it is only the component of 
V along J that is well defined. We want to apply this result to the case 
where V = li or 1 2 in the basis of eigenstates | LMl), and analogously 
for the spins. For ( L Mp \ li | L Ml) = c(L Ml \ L | L Ml) the constant 
c is determined by taking the dot product of both sides with L to give 

= (L M l \ li • L |L Ml) . 
c (LM l \L-L\LM l ) ’ 

hence 

(L M l \ li | L M l ) = (L M L | L | L M L ) . (5.5) 

This is an example of the projection theorem and can also be applied to 
1 2 and to Si and s 2 in the basis of eigenstates | S Ms). It is clear that, 
for diagonal matrix elements, these quantum mechanical results give the 
same result of the vector model. 




Fig. 5.4 In the LS'-coupling scheme 
the orbital angular momenta of the two 
electrons couple to give total angular 
momentum L = li + 1 2 . I n the vec- 
tor model li and 1 2 precess around L; 
similarly, si and s 2 precess around S. 
L and S precess around the total an- 
gular momentum J (but more slowly 
than the precession of li and 1 2 around 
L because the spin-orbit interaction is 
‘weaker’ than the residual electrostatic 
interaction). 



^ This is particular case of a more 
general result called the Wigner- 
Eckart theorem which is the corner- 
stone of the theory of angular momen- 
tum. This powerful theorem also ap- 
plies to off-diagonal elements such as 
and to more com- 
plicated operators such as those for 
quadrupole moments. It is used ex- 
tensively in advanced atomic physics — 
see the ‘Further reading’ section in this 
chapter. 




Fig. 5.5 A pictorial representation of 
the project theorem for an atom, where 
J defines the axis of the system. 
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^ Similarly, in the one-electron case we 
found the fine structure without deter- 
mining the eigenstates | Isjmj) explic- 
itly in terms of the Y^ m and spin func- 
tions. 
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Fig. 5.6 The fine structure of a 3 P 
term obeys the interval rule. 



10 In classical mechanics the word ‘cou- 
pling’ is commonly used more loosely, 
e.g. for coupled pendulums, or coupled 
oscillators, the ‘coupling between them’ 
is taken to mean the ‘interaction be- 
tween them’ that leads to their motions 
being coupled. (This coupling may take 
the form of a physical linkage such as a 
rod or spring between the two systems.) 



U E s - 0 ~ (3ls and E re is comparable 
to the exchange integral. 



Equation 5.4 has the same form as the spin-orbit interaction for the 
single-electron case but with capital letters rather than s-1. The constant 
Pls that gives the spin-orbit interaction for each term is related to that 
for the individual electrons (see Exercise 5.2). The energy shift is 

£s-o = p LS (s • L) . (5.6) 

To find this energy we need to evaluate the expectation value of the 
operator L • S = (J • J - L • L - S • S) /2 for each term 2S+1 L. Each 
term has (2 S + 1) (2 L + 1) degenerate states. Any linear combination 
of these states is also an eigenstate with the same electrostatic energy 
and we can use this freedom to choose suitable eigenstates and make the 
calculation of the (magnetic) spin-orbit interaction straightforward. We 
shall use the states | LSJMj); these are linear combinations of the basis 
states | LMlSMs) but we do not need to determine their exact form to 
find the eigenenergies. 9 Evaluation of eqn 5.6 with the states | LSJMj) 
gives 

Es- o = ^ {J (J +1) - L(L + 1) - S (S + 1)} . (5.7) 

Thus the energy interval between adjacent J levels is 

AE F s = Ej - Ej _ i = (5ls J • (5.8) 

This is called the interval rule. For example, a 3 P term {L = 1 = S) has 
three J levels: 2S+1 Lj = 3 P 0 , 3 Pi, 3 P 2 (see Fig. 5.6); and the separation 
between J = 2 and J — 1 is twice that between J = 1 and J = 0. The 
existence of an interval rule in the fine structure of a two-electron system 
generally indicates that the LS'-coupling scheme is a good approximation 
(see the ‘Exercises’ in this chapter); however, the converse is not true. 
The LS- coupling scheme gives a very accurate description of the energy 
levels of helium but the fine structure does not exhibit an interval rule 
(see Example 5.2 later in this chapter). 

It is important not to confuse LS'-coupling (or Russell-Saunders cou- 
pling) with the interaction between L and S given by /3ls S • L. In this 
book the word interaction is used for real physical effects described by 
a Hamiltonian and coupling refers to the forming of linear-combination 
wavefunctions that are eigenstates of angular momentum operators, e.g. 
eigenstates of L and S. The L^-coupling scheme breaks down as the 
strength of the interaction /3ls S • L increases relative to that of H re . 10 

5.2 The jj -coupling scheme 

To calculate the fine structure in the LS'-coupling scheme we treated the 
spin-orbit interaction as a perturbation on a term, 25+1 L. This is valid 
when E re E s _ 0 , which is generally true in light atoms. 11 The spin- 
orbit interaction increases with atomic number (eqn 4.13) so that it can 
be similar to E re for heavy atoms — see Fig. 5.7. However, it is only in 
cases with particularly small exchange integrals that E s _ 0 exceeds E re , 
so that the spin-orbit interaction must be considered before the residual 
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Energy (eV) 




Fig. 5.7 A plot of typical energies as a 
function of the atomic number Z (on 
logarithmic scales). A characteristic 
energy for the gross structure is taken 
as the energy required to excite an elec- 
tron from the ground state to the first 
excited state — this is less than the ion- 
ization energy but has a similar varia- 
tion with Z. The residual electrostatic 
interaction is the singlet-triplet separa- 
tion of the lowest excited configuration 
in some atoms with two valence elec- 
trons. The fine structure is the split- 
ting of the lowest p configuration. For 
all cases the plotted energies are fairly 
close to the maximum for that type 
of structure in neutral atoms — higher- 
lying configurations have smaller val- 
ues. 



electrostatic interaction. When H s - 0 acts directly on a configuration it 
causes the 1 and s of each individual electron to be coupled together to 
give ji = li + Si and $2 = h + S 2 ; in the vector model this corresponds 
to 1 and s precessing around j independently of the other electrons. 
In this jj-coupling scheme each valence electron acts on its own, as 
in alkali atoms. For an sp configuration the s-electron can only have 
j 1 = 1/2 and the p-electron has = 1/2 or 3/2; so there are two 
levels, denoted by (ji, j?) = (1/2, 1/2) and (1/2, 3/2). The residual 
electrostatic interaction acts as a perturbation on the jj-coupled levels; 
it causes the angular momenta of the electrons to be coupled to give 
total angular momentum J = ji + J 2 (as illustrated in Fig. 5.8). Since 
there is no external torque on the atom, Mj is also a good quantum 
number. For an sp configuration there are pairs of J levels for each of the 
two original jj-coupled levels, e.g. (ji, j 2 )j = (l/2,l/2) 0 , (1/2, 1/2) 1 
and (1/2, 3/2) 1 , (1/2, 3/2) 2 . This doublet structure, shown in Fig. 5.10, 
contrasts with the singlets and triplets in the LS'-coupling scheme. 
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Fig. 5.8 The jj-coupling scheme. The 
spin-orbit interaction energy is large 
compared to the E re . (Cf. Fig. 5.4 for 
the LS'-coupling scheme.) 

12 

In both of these cases we assume an 
isolated configuration, i.e. that the en- 
ergy separation of the different config- 
urations in the central field is greater 
than the perturbation produced by ei- 
ther E s - o or E re . 




In summary, the conditions for LS- and jj-coupling are as follows: 12 

L6'-coupling scheme: E re >> E s _ 0 , 

jj-coupling scheme: E 8 - Q E re . 



5.3 Intermediate coupling: the transition 
between coupling schemes 

In this section we shall look at examples of angular momentum coupling 
schemes in two-electron systems. Figure 5.9 shows energy- level diagrams 
of Mg and Hg and the following example looks at the structure of these 
atoms. 

Example 5.1 



3s3p, Mg 


6s6p, Hg 


2.1850 


3.76 


2.1870 


3.94 


2.1911 


4.40 


3.5051 


5.40 



The table gives the energy levels, in units of 10 6 m -1 measured from 
the ground state, for the 3s3p configuration in magnesium (Z = 12) and 
6s6p in mercury (Z = 80). We shall use these data to identify the levels 
and assign further quantum numbers. 

For an sp configuration we expect X P and 3 P terms. In the case of 
magnesium we see that the spacings between the three lowest levels are 
2000 m -1 and 4100 m -1 ; these are close to the 1 to 2 ratio expected from 
the interval rule for the levels with J = 0, 1 and 2 that arise from the 
triplet. The L5'-coupling scheme gives an accurate description because 
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H He Mg Hg Complex 




Fig. 5.9 The terms of helium, magnesium and mercury are plotted on the same energy scale (with hydrogen on the left for 
comparison). The fine structure of the lighter atoms is too small to be seen on this scale and the LS'-coupling scheme gives 
a very accurate description. This scheme gives an approximate description for the low-lying terms of mercury even though it 
has a much larger fine structure, e.g. for the 6s6p configuration the E re > E s - Q but the interval rule is not obeyed because 
the spin-orbit interaction is not very small compared to the residual electrostatic interaction. The Is 2 configuration of helium 
is not shown; it has a binding energy of — 24.6eV (see Fig. 3.4). The ls2s and ls2p configurations of helium lie close to the 
n = 2 shell in hydrogen, and similarly the ls3l configurations lie close to the n = 3 shell. In magnesium, the terms of the 
3snf configurations have very similar energies to those in hydrogen, but the differences get larger as l decreases. The energies 
of the terms in mercury have large differences from the hydrogen energy levels. Much can be learnt by carefully studying this 
term diagram, e.g. there is a 1 P term which has similar energy in the three configurations: ls2p, 3s3p and 6s6p in He, Mg and 
Hg, respectively — thus the effective quantum number n* is similar despite the increase in n. Complex terms arise when both 
valence electrons are excited in Mg, e.g. the 3p 2 configuration, and the 5d 9 6s 2 6p configuration in Hg. 
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13 

This identification of the levels is 
supported by other information, e.g. 
determination of J from the Zeeman ef- 
fect and the theoretically predicted be- 
haviour of an sp configuration shown in 
Fig. 5.10. 



the fine structure is much smaller than the energy separation (E re ~ 
1.3 x 10 6 m -1 ) between the 3 P term at ~ 2.2 x 10 6 m -1 and the 1 Pi 
level at 3.5 x 10 6 m -1 . In mercury the spacings of the levels, going 
down the table, are 0.18, 0.46 and 1.0 (in units of 10 6 m -1 ); these 
levels are not so clearly separated into a singlet and triplet. Taking the 
lowest three levels as 3 Po, 3 Pi and 3 P 2 we see that the interval rule is 
not well obeyed since 0.46/0.18 = 2.6 (not 2). 13 This deviation from 
the LS'-coupling scheme is hardly surprising since this configuration has 
a spin-orbit interaction only slightly smaller than the singlet-triplet 
separation. However, even for this heavy atom, the LS'-coupling scheme 
gives a closer approximation than the jj-coupling scheme. 




Fig. 5.10 A theoretical plot of the energy levels that arise from an sp configuration as a function of the strength of the 
spin-orbit interaction parameter (3 (of the p-electron defined in eqn 2.55). For (3 = 0 the two terms, 3 P and X P, have an energy 
separation equal to twice the exchange integral; this residual electrostatic energy is assumed to be constant and only (3 varies in 
the plot. As (3 increases the fine structure of the triplet becomes observable. As (3 increases further the spin-orbit and residual 
electrostatic interactions become comparable and the LS'-coupling scheme ceases to be a good approximation: the interval rule 
and (LS'-coupling) selection rules break down (as in mercury, see Fig. 5.9). At large (3 the jj-coupling scheme is appropriate. 
The operator J commutes with H s - 0 (and H re ); therefore H s - 0 only mixes levels of the same J, e.g. the two J = 1 levels in 
this case. (The energies of the J = 0 and 2 levels are straight lines because their wavefunctions do not change.) Exercise 5.8 
gives an example of this transition between the two coupling schemes for np(n + l)s configurations with n = 3 to 5 (that have 
small exchange integrals). 
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Example 5.2 The ls2p configuration in helium 



J 


E (m -1 ) 


2 


16 908 687 


1 


16 908 694 


0 


16 908 793 


1 


17113 500 



The table gives the values of J and the energy, in units of m -1 measured 
from the ground state, for the levels of the ls2p configuration in helium. 
The 3 P term has a fine-structure splitting of about 100m _1 that is 
much smaller than the singlet-triplet separation of 10 6 m -1 from the 
electrostatic interaction (twice the exchange integral). Thus the LS- 
coupling scheme gives an excellent description of the helium atom and 
the selection rules in Table 5.1 are well obeyed. But the interval rule is 
not obeyed — the intervals between the J levels are 7 m -1 and 99 m -1 and 
the fine structure is inverted. This occurs in helium because spin-spin 
and spin-other-orbit interactions have an energy comparable with that of 
the spin-orbit interaction. 14 However, for atoms other than helium, the 
rapid increase in the strength of the spin-orbit interaction with Z ensures 
that H s - 0 dominates over the others. Therefore the fine structure of 
atoms in the I/5'-coupling scheme usually leads to an interval rule. 



14 The spin-spin interaction arises from 
the interaction between two magnetic 
dipoles (independent of any relative 
motion). See eqn 6.12 and its expla- 
nation. 



Further examples of energy levels are given in the exercises at the end 
of this chapter. Figure 5.10 shows a theoretical plot of the transition 
from the LS- to the jj-coupling scheme for an sp configuration. Conser- 
vation of the total angular momentum means that J is a good quantum 
number even in the intermediate coupling regime and can always be used 
to label the levels. The notation 25+1 Lj for the L^-coupling scheme is 
often used even for systems in the intermediate regime and also for one- 
electron systems, e.g. Is 2 Si/ 2 for the ground state of hydrogen. 



Table 5.1 Selection rules for electric dipole (El) transitions in the LS'-coupling 
scheme. Rules 1-4 apply to all electric dipole transitions; rules 5 and 6 are obeyed 
only when L and S are good quantum numbers. The right-hand column gives the 
structure to which the rule applies. 



1 


AJ = 0, ±1 


(J = 0«J' = 0) 


Level 


2 


A Mj = 0, ±1 


(. Mj = 0« Mj, = 0 if A J = 0) 


State 


3 


Parity changes 




Configuration 


4 


A l = ±1 


One electron jump 


Configuration 


5 


AL = 0, ±1 


(L = 0 ^ L' = 0) 


Term 


6 


AS = 0 




Term 
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15 

There is no simple physical ex- 
planation of why an Mj = 0 to 

Mj/ = 0 transition does not occur 
if J = J'\ it is related to the sym- 
metry of the dipole matrix element 
(yj Mj = 0\r\Y J Mj = 0), where 7 
and 7 ' represent the other quantum 
numbers. The particular case of J = 
J' = 1 and AMj = 0 is discussed in 
Budker et al. (2003). 



15 This line comes from the second level 
in the table given in Example 5.1, since 
0.254 |Um= 1/(3.941 x 10 6 m- 1 ). 



17 

Intercombination lines are not ob- 
served in magnesium and helium. The 
relative strength of the intercombina- 
tion lines and allowed transitions are 
tabulated in Kuhn (1969). 



5.4 Selection rules in the TS'-coupling 
scheme 

Table 5.1 gives the selection rules for electric dipole transitions in the 
L6'-coupling scheme (listed approximately in the order of their strict- 
ness). The rule for J reflects the conservation of this quantity and is 
strictly obeyed; it incorporates the rule for A j in eqn 2.59, but with 
the additional restriction J = 0 J' = 0 that affects the levels with 
J = 0 that occur in atoms with more than one valence electron. The 
rule for A Mj follows from that for A J: the emission, or absorption, of 
a photon cannot change the component along the z-axis by more than 
the change in the total atomic angular momentum. (This rule is rele- 
vant when the states are resolved, as in the Zeeman effect described in 
the following section.) 15 The requirement for an overall change in parity 
and the selection rule for orbital angular momentum were discussed in 
Section 2.2. In a configuration n 1 / 1 77 , 2/2 77 , 3/3 * * 'n x l x only one electron 
changes its value of / (and may also change n). The rule for A L al- 
lows transitions such as 3p4s 3 Pi-3p4p 3 Pi. The selection rule AS = 0 
arises because the electric dipole operator does not act on spin, as noted 
in Chapter 3 on helium; as a consequence, singlets and triplets form 
two unconnected sets of energy levels, as shown in Fig. 3.5. Similarly, 
the singlet and triplet terms of magnesium shown in Fig. 5.9 could be 
rearranged. In the mercury atom, however, transitions with AS = 1 
occur, such as 6s 2 1 So-6s6p 3 Pi, that gives a so-called intercombination 
line with a wavelength of 254nm. 16 This arises because this heavy atom 
is not accurately described by the LS'-coupling scheme and the spin- 
orbit interaction mixes some x Pi wavefunction into the wavefunction 
for the term that has been labelled 3 Pi (this being its major compo- 
nent). Although not completely forbidden, the rate of this transition is 
considerably less than it would be for a fully-allowed transition at the 
same wavelength; however, the intercombination line from a mercury 
lamp is strong because many of the atoms excited to triplet terms will 
decay back to the ground state via this transition (see Fig. 5. 9). 17 
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18 

Most quantum mechanics texts de- 
scribe the anomalous Zeeman effect for 
a single valence electron that applies to 
the alkalis and hydrogen. 



The Zeeman effect for atoms with a single valence electron was not 
presented in earlier chapters to avoid repetition and that case is covered 
by the general expression derived here for the LS'-coupling scheme. 18 
The atom’s magnetic moment has orbital and spin contributions (see 
Blundell 2001, Chapter 2): 

M = -/i b L - g s TBS • (5.9) 



The interaction of the atom with an external magnetic field is described 
by Hze = — B. The expectation value of this Hamiltonian can be 
calculated in the basis | LSJMj), provided that Eze E s - q <C E re , 
i.e. the interaction can be treated as a perturbation to the fine-structure 
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levels of the terms in the L^-coupling scheme. In the vector model we 
project the magnetic moment onto J (see Fig. 5.11) following the same 
rules as are used in treating fine structure in the L6'-coupling scheme 
(and taking B = Be z ). This gives 



H/e = - 






J(J + 1) 



J B = 



J(J+1) 



-/ilB&Jz 



In the vector model the quantities in angled brackets are time averages. 19 
In a quantum description treatment the quantities (• • • ) are expectation 
values of the form (J Mj \ ■ • • | J Mj) 20 In the vector model 



Eze = gjHE>BMj , (5-11) 



where the Lande g- factor is gj = {(L • J) + g s (S • J)} / { J (J + 1)} . As- 
suming that g s — 2 (see Section 2.3.4) gives 



3 S(S+1)-L(L+1) 

2 + 2 J ( J + 1) 



(5.12) 



Singlet terms have S = 0 so J = L and gj = 1 (no projection is nec- 
essary). Thus singlets all have the same Zeeman splitting between Mj 
states and transitions between singlet terms exhibit the normal Zeeman 
effect (shown in Fig. 5.12). The A Mj = ±1 transitions have frequencies 
shifted by ±i±b>B fin with respect to the A Mj = 0 transitions. 

In atoms with two valence electrons the transitions between triplet 
terms exhibit the anomalous Zeeman effect. The observed pattern de- 
pends on the values of gj and J for the upper and lower levels, as shown 
in Fig. 5.13. In both the normal and anomalous effects the 7r-transitions 
(A Mj = 0) and cr-transitions (A Mj = ±1) have the same polarizations 
as in the classical model in Section 1.8. Other examples in Exercises 5.10 
to 5.12 show how observation of the Zeeman pattern gives information 
about the angular momentum coupling in the atom. (The Zeeman ef- 
fect observed for the 2 P l/2 -2 Sl/2 and 2 P 3 / 2 — 2 ^i /2 transitions that arise 
between the fine-structure components of the alkalis and hydrogen is 
treated in Exercise 5.13.) Exercise 5.14 goes through the Paschen-Back 
effect that occurs in a strong external magnetic field — see Fig. 5.14. 



B 




Fig. 5.11 The projection of the 
contributions to the total magnetic 
moment from the orbital motion and 
the spin are projected along J. 

19 Components perpendicular to J 
time-average to zero. 

20 

This statement is justified by the pro- 
jection theorem (Section 5.1), derived 
from the more general Wigner-Eckart 
theorem. The theorem shows that the 
expectation value of the vector opera- 
tor L is proportional to that of J in the 
basis of eigenstates | J Mj), i.e. 

(JMj\L\JMj) oc (J Mj\ J | J Mj ) , 

and similarly for the expectation value 
of S. The component along the mag- 
netic field is found by taking the dot 
product with B: 

{J Mj \ L • B | J Mj) 

oc (JMj\ J -B\JMj) 
oc (JMj\J z \JMj) = Mj. 
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Fig. 5.12 The normal Zeeman effect 
on the ls2p 1 Pi-ls3d 1 T>2 line in he- 
lium. These levels split into three and 
five Mj states, respectively. Both lev- 
els have S = 0 and gj = 1 so that the 
allowed transitions between the states 
give the same pattern of three compo- 
nents as the classical model (in Sec- 
tion 1.8) — this is the historical reason 
why it is called the normal Zeeman ef- 
fect. Spectroscopists called any other 
pattern an anomalous Zeeman effect, 
although such patterns have a straight- 
forward explanation in quantum me- 
chanics and arise whenever S ^ 0, 
e.g. all atoms with one valence elec- 
tron have S = 1/2. The 7r- and cr- 
components arise from A Mj = 0 and 
A Mj = ±1 transitions, respectively. 
(In this example of the normal Zeeman 
effect each component corresponds to 
three allowed electric dipole transitions 
with the same frequency but they are 
drawn with a slight horizontal separa- 
tion for clarity.) 
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Fig. 5.13 The anomalous Zeeman ef- 
fect for the 6s6p 3 P 2 -6s7s 3 Si transi- 
tion in Hg. The lower and upper levels 
both have the same number of Zeeman 
sub-levels (or Mj states) as the levels in 
Fig. 5.12, but give rise to nine separate 
components because the levels have dif- 
ferent values of gj. (The 6s7s config- 
uration happens to have higher energy 
than 6s6p, as shown in Fig. 5.9, but the 
Zeeman pattern does not depend on the 
relative energy of the levels.) 
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Fig. 5.14 The Paschen-Back effect oc- 
curs in a strong external magnetic field. 
The spin and orbital angular momen- 
tum precess independently about the 
magnetic field. The states have ener- 
gies given by E PB = fi B B(M L +2M S )- 



5.6 Summary 

Figure 5.15 shows the different layers of structure in a case where the 
LS'-coupling scheme is a good approximation, i.e. the residual electro- 
static interaction dominates the two magnetic interactions (spin-orbit 
and with an external magnetic field). The spin-orbit interaction splits 
terms into different J levels. The Zeeman effect of a weak magnetic 
field splits the levels into states of given Mj, that are also referred to as 
Zeeman sub-levels. 

There are various ways in which this simple picture can break down, 
(a) Configuration mixing occurs if the residual electrostatic interaction 



3s3p 

Configuration 



X P 



J = 1 




J = 2 



3p 



J = 1 



J = 0 





Terms 



Levels 



States 



Fig. 5.15 The hierarchy of atomic structure for the 3s3p configuration of an alkaline earth metal atom. 
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is not small compared to the energy gap between the configurat- 
ions — this is common in atoms with complex electronic structure. 

(b) The jj - coupling scheme is a better approximation than LS - coupling 
or Russell-Saunders coupling when the spin-orbit interaction is 
greater than the residual electrostatic interaction. 

(c) The Paschen-Back effect arises when the interaction with an exter- 
nal magnetic field is stronger than the spin-orbit interaction (with 
the internal field). This condition is difficult to achieve except for 
atoms with a low atomic number and hence small fine structure. 
Similar physics arises in the study of the Zeeman effect on hyperfine 
structure where the transition between the low-field and high-field 
regimes occurs at values of the magnetic field that are easily acces- 
sible in experiments (see Section 6.3). 



Further reading 

The mathematical methods that describe the way in which angular 
momenta couple together form the backbone of the theory of atomic 
structure. In this chapter the quantum mechanical operators have been 
treated by analogy with classical vectors (the vector model) and the 
Wigner-Eckart theorem was mentioned to justify the projection theo- 
rem. Graduate-level texts give a more comprehensive discussion of the 
quantum theory of angular momentum, e.g. Cowan (1981), Brink and 
Satchler (1993) and Sobelman (1996). 



Exercises 



(5.1) Description of the LS-coupling scheme 
Explain what is meant by the central-field approx- 
imation and show how it leads to the concept of 
electron configurations. Explain how perturba- 
tions arising from (a) the residual electrostatic in- 
teractions, and (b) the magnetic spin-orbit inter- 
actions, modify the structure of an isolated multi- 
electron configuration in the L AS-coupling limit. 

(5.2) Fine structure in the LS-coupling scheme 

Show from eqn 5.4 that the J levels of the 3 P term 
in the 3s4p configuration have a separation given 
by eqn 5.8 with /3ls — /34 P / 2 (where /34 P s • 1 is the 
spin-orbit interaction of the 4p-electron) . 

(5.3) The LS-coupling scheme and the interval rule in 
calcium 

Write down the ground configuration of calcium 
(Z — 20). The line at 610 nm in the spectrum 



of neutral calcium consists of three components 
at relative positions 0, 106 and 158 (in units of 
cm -1 ). Identify the terms and levels involved in 
these transitions. 

The spectrum also contains a multiplet of six lines 
with wavenumbers 5019, 5033, 5055, 5125, 5139 
and 5177 (in units of cm -1 ). Identify the terms 
and levels involved. Draw a diagram of the rel- 
evant energy levels and the transitions between 
them. What further experiment could be carried 
out to check the assignment of quantum numbers? 

(5.4) The LS-coupling scheme in zinc 

The ground configuration of zinc is 4s 2 . The 
seven lowest energy levels of zinc are 0, 32 311, 
32 501, 32 890, 46 745, 53 672 and 55 789 (in units 
of cm -1 ). Sketch an energy- level diagram that 
shows these levels with appropriate quantum num- 
bers. What evidence do these levels provide that 
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the LS - coupling scheme describes this atom. Show 
the electric dipole transitions that are allowed be- 
tween the levels. 

(5.5) The LS-coupling scheme 



3s3p, Mg 


3s3p, Fe 14+ 


2.1850 


23.386 


2.1870 


23.966 


2.1911 


25.378 


3.5051 


35.193 



The table gives the energy levels, in units of 
10 6 m -1 measured from the ground state, of the 
3s3p configuration in neutral magnesium (Z = 
12) and the magnesium-like ion Fe 14+ . Suggest, 
with reasons, further quantum numbers to iden- 
tify these levels. Calculate the ratio of the spin- 
orbit interaction energies in the 3s3p configura- 
tion of Mg and Fe 14+ , and explain your result. 
Discuss the occurrence in the solar spectrum of 
a strong line at 41.726 nm that originates from 
Fe 14+ . Would you expect a corresponding tran- 
sition in neutral Mg? 

(5.6) LS-coupling for configurations with equivalent 
electrons 

(a) List the values of the magnetic quantum num- 
bers raq, m Sl , mi 2 and m S2 for the two elec- 
trons in an np 2 configuration to show that fif- 
teen degenerate states exist within the central- 
field approximation. Write down the values of 
Ml = mq -\-mi 2 and Ms = m Sl + m S2 associ- 
ated with each of these states to show that the 
only possible terms in the LS - coupling scheme 
are 1 S, 3 P and X D. 

(b) The ls 2 2s 2 2p 2 configuration of doubly- ionized 
oxygen has levels at relative positions 0, 113, 
307, 20 271 and 43184 (in units of cm -1 ) 
above the ground state, and its spectrum con- 
tains weak emission lines at 19 964 cm -1 and 
20158 cm -1 . Identify the quantum numbers 
for each of the levels and discuss the extent to 
which the LS - coupling scheme describes this 
multiplet. 

(c) For six d-electrons, in the same sub-shell, 
write a list of the values of the m s and mi 
for the individual electrons corresponding to 
Ms = 2 and Ml — 2. Briefly discuss why this 
is the maximum value of Ms , and why Ml ^ 2 
for this particular value of Ms- (Hence from 



Hund’s rules the 5 D term has the lowest en- 
ergy.) Specify the lowest-energy term for each 
of the five configurations nd, nd 2 , nd 3 , nd 4 
and nd 5 . 

(5.7) Transition from LS- to jj -coupling 



J 


3p4s, Si 

Energy (10 6 m -1 


) j 


3p7s, Si 

Energy (10 6 m -1 ) 


0 


3.968 


0 


6.154 


1 


3.976 


1 


6.160 


2 


3.996 


2 


6.182 


1 


4.099 


1 


6.188 



The table gives J-values and energies (in units of 
10 6 m -1 measured from the ground state) of the 
levels in the 3p4s and 3p7s configurations of sili- 
con. Suggest further quantum numbers to identify 
the levels. 

Why do the two configurations have nearly the 
same value of Ej = 2 — Ej = 0 but quite different en- 
ergy separations between the two J — 1 states? 

(5.8) Angular-momentum coupling schemes 



4p5s, germanium 
J Energy (10 6 m -1 ) 


J 


5p6s, tin 
Energy 


0 


3.75 


0 


3.47 


1 


3.77 


1 


3.49 


2 


3.91 


2 


3.86 


1 


4.00 


1 


3.93 



The table gives the J-values and energies (in units 
of 10 6 m -1 measured from the ground state) of the 
levels in the configurations 4p5s in Ge and 5p6s in 
Sn. Data for the 3p4s configuration in Si are given 
in the previous exercise. How well does the LS- 
coupling scheme describe the energy levels of the 
np(n + l)s configurations with n — 3, 4 and 5? 
Give a physical reason for the observed trends in 
the energy levels. 

One of the J — 1 levels in Ge has a Lande ^-factor 
of gj — 1.06. Which level would you expect this 
to be and why? 

(5.9) Selection rules in the LS-coupling scheme 

State the selection rules that determine the config- 
urations, terms and levels that can be connected 
by an electric dipole transition in the LS'-coupling 
approximation. Explain which rules are rigorous, 
and which depend on the validity of the coupling 
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scheme. Give a physical justification for three of 
these rules. 

Which of the following are allowed for electric 
dipole transitions in the LS- coupling scheme: 



(a) 


ls2s 


3 Si-ls3d 


3 Di, 


(b) 


ls2p 


3 Pi-ls3d 


3 d 3 , 


(c) 


2s2p 


3 Pi-2p 2 


3 Pi, 


(d) 


CO 

to 


3 Pi-3p 2 3 


Pa, 


(e) 


3p 6 


1 S 0 -3p 5 3d 


^2? 



The transition 4d 9 5s 2 2 D 5 / 2 -4d 10 5p 2 P 3 / 2 satis- 
fies the selection rules for L, S and J but it ap- 
pears to involve two electrons jumping at the same 
time. This arises from configuration mixing — the 
residual electrostatic interaction may mix configu- 
rations. 21 The commutation relations in eqns 5.2 
and 5.3 imply that H re only mixes terms of the 
same L, S and J. Suggest a suitable configuration 
that gives rise to a 2 P 3 /2 level that could mix with 
the 4d 10 5p configuration to cause this transition. 

(5.10) The anomalous Zeeman effect 

What selection rule governs A Mj in electric dipole 
transitions? Verify that the 3 Si- 3 P 2 transition 
leads to the pattern of nine equally-spaced lines 
shown in Fig. 5.13 when viewed perpendicular to 
a weak magnetic field. Find the spacing for a mag- 
netic flux density of 1 T. 

(5.11) The anomalous Zeeman effect 

Draw an energy- level diagram for the states of 3 Si 
and 3 Pi levels in a weak magnetic held. Indi- 
cate the allowed electric dipole transitions between 
the Zeeman states. Draw the pattern of lines ob- 
served perpendicular to the held on a frequency 
scale (marked in units of fi^B/h). 

(5.12) The anomalous Zeeman effect 



I 2 



2 




3 | 2 



2 



3 | 2 | 2 (Frequency 



The above Zeeman pattern is observed for a spec- 
tral line that originates from one of the levels of a 
3 P term in the spectrum of a two-electron system; 
the numbers indicate the relative separations of 
the lines, observed perpendicular to the direction 
of the applied magnetic held. Identify L, S and J 
for the two levels in the transition. 22 

(5.13) The anomalous Zeeman effect in alkalis 

Note that atoms with one valence electron are not 
discussed explicitly in the text. 

(a) Give the value of gj for the one-electron levels 
2 Si/ 2 , 2 Pi /2 and 2 P3/2- 

(b) Show that the Zeeman pattern for the 
3s 2 S i/ 2 -3p 2 P 3/2 transition in sodium has 
six equally-spaced lines when viewed perpen- 
dicular to a weak magnetic held. Find the 
spacing (in GHz) for a magnetic hux density 
of 1 T. Sketch the Zeeman pattern observed 
along the magnetic held. 

(c) Sketch the Zeeman pattern observed per- 
pendicular to a weak magnetic held for the 
3s 2 S i/ 2 -3p 2 Pi /2 transition in sodium. 

(d) The two hne-structure components of the 3s- 
3p transition in sodium in parts (b) and (c) 
have wavelengths of 589.6 nm and 589.0 nm, 
respectively. What magnetic hux density pro- 
duces a Zeeman splitting comparable with the 
hne structure? 23 

(5.14) The Paschen-Back effect 

In a strong magnetic held L and S precess in- 
dependently about the held direction (as shown 
in Fig. 5.14), so that J and Mj are not good 
quantum numbers and appropriate eigenstates are 
| LMlSMs) ■ This is called the Paschen-Back ef- 
fect. In this regime the LS- coupling selection rules 
are A Ml = 0, ±1 and A Ms = 0 (because the elec- 
tric dipole operator does not act on the spin). 24 
Show that the Paschen-Back effect leads to a pat- 
tern of three lines with the same spacing as in the 
normal Zeeman effect (i.e. the same as if we com- 
pletely ignore spin). 25 



21 In the discussion of the LS'-coupling scheme we treated H re as a perturbation on a configuration and assumed that E re 
is small compared to the energy separation between the configurations in the central held. This is rarely true for high-lying 
configurations of complex atoms. 

22 The relative intensities of the components have not been indicated. 

23 This value is greater than 1 T so the assumption of a weak held in part (b) is valid. 

24 The rules for J and Mj are not relevant in this regime. 

25 The Paschen-Back effect occurs when the valence electrons interact more strongly with the external magnetic held than 
with the orbital held in H s —o • The L5-coupling scheme still describes this system, i.e. L and S are good quantum numbers. 
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6.1 Hyperfine structure 



Up to this point we have regarded the nucleus as an object of charge 
-\-Ze and mass Mn, but it has a magnetic moment /ij that is related to 
the nuclear spin I by 

Vi = 9i V nI- (6.1) 

Comparing this to the electron’s magnetic moment — g s [1b s we see that 
there is no minus sign. 1 Nuclei have much smaller magnetic moments 
than electrons; the nuclear magneton /in is related to the Bohr magneton 
/iB by the electron-to-proton mass ratio: 



m e fi B 

^ = 1836 



( 6 . 2 ) 



The interaction of fi : with the magnetic flux density created by the 
atomic electrons B e gives the Hamiltonian 



Hylfs = — AU ' B e • 



(6.3) 



This gives rise to hyperfine structure which, as its name suggests, is 
smaller than fine structure. Nevertheless, it is readily observable for 
isotopes that have a nuclear spin (/ ^ 0). 

The magnetic field at the nucleus is largest for s-electrons and we 
shall calculate this case first. For completeness the hyperfine structure 
for electrons with l ^ 0 is also briefly discussed, as well as other effects 
that can have similar magnitude. 
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1 Nuclear magnetic moments can be 
parallel, or anti-parallel, to I, i.e. gj 
can have either sign depending on the 
way that the spin and orbital angular 
momenta of the protons and neutrons 
couple together inside the nucleus. The 
proton (that forms the nucleus of a hy- 
drogen atom) has g p > 0 because of 
its positive charge. More generally, nu- 
clear magnetic moments can be pre- 
dicted from the shell model of the nu- 
cleus. 



6.1.1 Hyperfine structure for s-electrons 

We have previously considered the atomic electrons as having a charge 
distribution of density — e \ip (r) | 2 , e.g. in the interpretation of the direct 
integral in helium in eqn 3.15 (see also eqn 6.22). To calculate mag- 
netic interactions we need to consider an s-electron as a distribution of 
magnetisation given by 



M = -5s£<Bs|V’(r)| 2 • (6-4) 

This corresponds to the total magnetic moment of the electron —g s g bs 
spread out so that each volume element d 3 r has a fraction \vp (r)| 2 d 3 r 
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Fig. 6.1 (a) An illustration of a nu- 
cleus with spin I surrounded by the 
spherically-symmetric probability dis- 
tribution of an s-electron. (b) That 
part of the s-electron distribution in the 
region r < r^ corresponds to a sphere 
of magnetisation M anti-parallel to the 
spin s. 



2 See Blundell (2001) or electromag- 
netism texts. 



3 

This spherical boundary at r = r^ 
does not correspond to anything physi- 
cal in the atom but is chosen for math- 
ematical convenience. The radius r b 
should be greater than the radius of the 
nucleus tn and it easy to fulfil the con- 
ditions r*N <C r*b <C ao since typical nu- 
clei have a size of a few fermi (10 -15 m), 
which is five orders of magnitude less 
than atomic radii. 

4 The magnetisation is a function of 
r only and therefore each shell has a 
uniform magnetisation M(r) between r 
and r + dr. The proof that these shells 
do not produce a magnetic flux den- 
sity at r = 0 does not require M (r) 
to be the same for all the shells, and 
clearly this is not the case. Alterna- 
tively, this result can be obtained by in- 
tegrating the contributions to the field 
at the origin from the magnetic mo- 
ments M(r)d 3 r over all angles ( 9 and 
<«• 




of the total. For s-electrons this distribution is spherically symmetric 
and surrounds the nucleus, as illustrated in Fig. 6.1. To calculate the 
field at r = 0 we shall use the result from classical electromagnetism 2 
that inside a uniformly magnetised sphere the magnetic flux density is 

B e = ^oM. (6.5) 

However, we must be careful when applying this result since the distri- 
bution in eqn 6.4 is not uniform — it is a function of r. We consider the 
spherical distribution in two parts. 

(a) A sphere of radius r = r*b, where r b <C ao so that the electronic 
wavefunction squared has a constant value of |'0(O)| 2 throughout 
this inner region, as indicated on Fig. 6. 2. 3 From eqn 6.5 the field 
inside this uniformly magnetised sphere is 

B e = IVw (0)| 2 s. (6.6) 

(b) The part of the distribution outside the sphere r > produces no 
field at r = 0, as shown by the following argument. Equation 6.5 
for the field inside a sphere does not depend on the radius of that 
sphere — it gives the same field for a sphere of radius r and a sphere of 
radius r+dr. Therefore the contribution from each shell of thickness 
dr is zero. The region r > can be considered as being made up of 
many such shells that give no additional contribution to the field. 4 

Putting this field and ptj from eqn 6.1 into eqn 6.3 gives 

2 2 

-HhFS = gi/J ' N I • - /A) Qsl ^ B iV’ns (0) | S = A I • S . 



(6.7) 
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IVVs(r )| 2 




Pn (0 : 

A • 




j 


r 



Fig. 6.2 (a) The probability density (r ) | 2 of an s-electron at small distances (r <C 
no) is almost constant. The distribution of nuclear matter pn(t) gives an indication 
of the nuclear radius • To calculate the interaction of the nuclear magnetic moment 
with an s-electron the region is divided into two parts by a boundary surface of radius 
r = r b r N (as also shown in Fig. 6.1). The inner region corresponds to a sphere 
of uniform magnetisation that produces a flux density B e at r = 0. The nuclear 
magnetic moment interacts with this field. 



This is called the Fermi contact interaction since it depends on \f; ns (0)| 2 
being finite. It can also be expressed as 



Hufs = A I J (6.8) 

because J = s for / = 0. It is useful to write down this more general 
form at an early stage since it turns out that an interaction proportional 
to I • J is also obtained when 1^0. 

We have already considered the effect of an interaction proportional 
to a dot product of two angular momenta when looking at the spin-orbit 
interaction /3 S • L (eqn 5.4). In the same way the hyperfine interaction 
in eqn 6.8 causes I and J to change direction but the total angular 
momentum of the atom F = I + J remains constant. The quantities 
I • B and J • B are not constant in this precession of I and J around F. 
Therefore M/ and Mj are not good quantum numbers; we use F and 
Mp instead and the eigenstates of #hfs are | IJFMp)f The expectation 
value of eqn 6.8 gives 

£hfs = A (I • J) = j {F(F + 1) - /(/ + 1) - J(J + 1)} . (6.9) 

Example 6.1 Hyperfine structure of the Is 2 Si/ 2 ground state of hy- 
drogen 

The lowest level of the hydrogen atom is Is 2 Si/ 2 , i.e. J = \ and the 
proton has spin I = 1 /2 so that F = 0 and 1 and these hyperfine levels 
have energies 

£hfs = J {F(F + 1) - /(/ + 1) - J(J + 1)} = j^ /4 F f = l ; 

The splitting between the hyperfine levels is AEhfs = A (see Fig. 6.3). 
Substituting for | if ns (0)| 2 from eqn 2.22 gives 

2 Z 3 

A = -/io gsh * b gi^N — 3 — o • (6.10) 

3 7 t a^n 6 



5 This should be compared with the LS- 
coupling scheme where combinations 
of the eigenstates | LMj^SMs) form 
eigenstates of the spin-orbit interaction 
| LSJMj). The same warning issued for 
LS'-coupling and the spin-orbit inter- 
action /3S • L also applies. It is impor- 
tant not to confuse I J-coupling and the 
interaction AI • J. In the IJ - coupling 
scheme I and J are good quantum num- 
bers and the states are \IJMjMj) or 
| IJFMp). The latter are eigenstates 
of the interaction AI • J. 



F — 1 




Fig. 6.3 The splitting between the hy- 
perfine levels in the ground state of hy- 
drogen. 
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6 This hyperfine transition in hydrogen 
is the one detected in radio astronomy, 
where it is commonly referred to by its 
wavelength as the 21 cm line. 



7 . 

This works on the same basic princi- 
ple as the Stern-Gerlach experiment in 
Section 6.4. 



Fig. 6.4 The hydrogen maser. The 
principle of operation is described in 
the text. A magnetic shield excludes 
external fields and the solenoid cre- 
ates a small stable magnetic field. In 
this way the frequency shift produced 
by the Zeeman effect on the hyperfine 
structure is controlled (in the same way 
as in the atomic clock described in Sec- 
tion 6.4.2). 



The proton has a ^-factor of gi — 5.6, so for n = 1 = Z we have 



A£hfs 

h 



1.42 GHz. 



This is very close to the measured value of 1 420 405 751.7667±0.0009Hz. 
This frequency has been measured to such extremely high precision be- 
cause it forms the basis of the hydrogen maser, described below. 6 



6.1.2 Hydrogen maser 

Maser stands for microwave amplification by stimulated emission of radi- 
ation and such devices were the precursors of lasers, although nowadays 
they are less widespread than the devices using light. Actually, lasers are 
usually operated as oscillators rather than amplifiers but the acronym 
with ‘o’ instead of ‘a’ is not so good. A schematic of a hydrogen maser 
is shown in Fig. 6.4; it operates in the following way. 

• Molecular hydrogen is dissociated in an electrical discharge. 

• Atoms effuse from the source to form a beam in an evacuated chamber. 

• The atoms pass through a region with a strong magnetic field gradient 
(from a hexapole magnet) that focuses atoms in the upper hyperfine 
level (F = 1) into a glass bulb. The atomic beam contains atoms in 
both hyperfine levels but the state selection by the magnet 7 leads to a 



Atoms in states 
F = 1. M f = 0 
and 

F = 1. M f = 1 




Microwave 

output 
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population inversion in the bulb, i.e. the population in F = 1 exceeds 
that in F = 0; this gives more stimulated emission than absorption 
and hence gain, or amplification of radiation at the frequency of the 
transition. 

• The atoms bounce around inside the bulb — the walls have a ‘non-stick’ 
coating of teflon so that collisions do not change the hyperfine level. 

• The surrounding microwave cavity is tuned to the 1.42 GHz hyperfine 
frequency and maser action occurs when there are a sufficient number 
of atoms in the upper level. Power builds up in a microwave cavity, 
some of which is coupled out through a hole in the wall of the cavity. 

The maser frequency is very stable — much better than any quartz 
crystal used in watches. However, the output frequency is not precisely 
equal to the hyperfine frequency of the hydrogen atoms because of the 
collisions with the walls (see Section 6.4). 



6.1.3 Hyperfine structure for l ^ 0 



Electrons with l ^ 0, orbiting around the nucleus, give a magnetic field 



= Mo [ ~ev x (-r) 
e 47r \ r 3 



Me 



3 (Me • r)r 

r 3 



( 6 . 11 ) 



where — r gives the position of the nucleus with respect to the orbiting 
electron. The first term arises from the orbital motion. 8 It contains 
the cross-product of — ev with — r, the position vector of the nucleus 
relative to the electron, and — er x v = — 2/xb1- The second term is just 
the magnetic field produced by the spin dipole moment of the electron 
pi e = — 2/j, b s (taking g s = 2) at a position — r with respect to the dipole. 9 
Thus we can write 



B e 



2 Mo Mb 

4tt r 3 



{ 



1 — s + 



3 (s • r) r 



( 6 . 12 ) 



This combination of orbital and spin-dipolar fields has a complicated 
vector form. 10 However, we can again use the argument (in Section 5.1) 
that in the vector model there is rapid precession around J, and any 
components perpendicular to this quantisation axis average to zero, so 
that only components along J have a non-zero time-averaged value. 11 
The projection factor can be evaluated exactly (Woodgate 1980) but we 
shall assume that it is approximately unity giving 



(613) 

Thus from eqns 6.1 and 6.3 we find that the hyperfine interaction for 
electrons with l ^ 0 has the same form AI • J as eqn 6.8. This form of 
interaction leads to the following interval rule for hyperfine structure: 



Ep — Ep—i = A F . 



(6.14) 



This interval rule is derived in the same way as eqn 5.8 for fine structure 
but with /, J and F instead of L, S and J, as shown in Exercise 6.5. 



o 

This resembles the Biot-Savart law of 
electromagnetism: 

^ po I ds X r 
47T r 3 

Roughly speaking, the displacement 
along the direction of the current is re- 
lated to the electron’s velocity by ds = 
vdt, where d t is a small increment of 
time, and the current is related to the 
charge by I dt = Q. 

The spin-orbit interaction can be very 
crudely ‘justified’ in a similar way, by 
saying that the electron ‘sees’ the nu- 
cleus of charge +Ze moving round it; 
for a hydrogenic system this simplistic 
argument gives 



-■^orbital — . q ^ • 

4tt 



The Thomas precession factor does not 
occur in hyperfine structure because 
the frame of reference is not rotating. 

9 See Blundell (2001). 

^ (l The same two contributions to the 
field also occur in the fine structure of 
helium; the field produced at the po- 
sition of one electron by the orbital 
motion of the other electron is called 
the spin-other-orbit interaction, and a 
spin-spin interaction arises from the 
field produced by the magnetic dipole 
of one electron at the other electron. 

11 In quantum mechanics this corre- 
sponds to saying that the matrix 
elements of any vector operator in 
the eigenbasis | JMj) are propor- 
tional to J, i.e. (J Mj|B e \J Mj) oc 
(J Mj | J | J Mj). This is a consequence 
of the Wigner-Eckart theorem that was 
mentioned in Section 5.1. 
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Table 6.1 Comparison of fine and hyperfine structures. 





Fine structure in the 


Hyperfine structure in the 




LiS-coupling scheme 


IJ - coupling scheme 


Interaction 


/3 L • S 


A I J 


Total angular momentum 


J = L + S 


F = 1 + J 


Eigenstates 


\LSJMj) 


| IJFMf) 


Energy, E 


f { J (J + 1) - L {L + 1) - S (S + 1)} 


4{F(F+l)-/(/+l)-J(J+l)} 


Interval rule 


Ej - Ej- 1 = f3J 
(if E s - 0 < E le ) 


Ef — Ef—i — AF 

(if A AEQuadrupole) 



12 

This interval rule for magnetic dipole 
hyperfine structure can be disrupted by 
the quadrupole interaction. Some nu- 
clei are not spherical and their charge 
distribution has a quadrupole moment 
that interacts with the gradient of the 
electric field at the nucleus. This elec- 
tric quadrupole interaction turns out to 
have an energy comparable to the inter- 
action of the magnetic dipole moment 
yi with B e . Nuclei, and atoms, do not 
have static electric dipole moments (for 
states of definite parity). 



That exercise also shows how this rule can be used to deduce F and 
hence the nuclear spin I from a given hyperfine structure. 12 

The hyperfine-structure constant A(n,l,j) is smaller for l > 0 than 
for l = 0 and the same n. Exact calculation shows that the hyperfine- 
structure constants of the hydrogenic levels np 2 Pi /2 and ns 2 Si/ 2 are 
in the ratio 



Myi 2 P 1/2 ) l 

A{n 2 Si/ 2 ) 3' 



(6.15) 



This ratio is smaller in the alkalis, e.g. ~ 1/10 in the examples below, 
because the closed shells of electrons screen the nuclear charge more 
effectively for p-electrons than for s-electrons. 



6.1.4 Comparison of hyperfine and fine structures 

The analogy between hyperfine and fine structures is summarised in 
Table 6.1. 

For fine structure in the alkalis we found the Lande formula (eqn 4.13) 

y2y2 

.£7fs ^ — ■ — ^oPhcRoo . (6.16) 

(n*) 

The Z 4 scaling for a hydrogenic system is reduced to oc Z 2 for neu- 
tral atoms since the effective outer atomic number is Z 0 = 1 , and Z\ ~ Z 
gives a reasonable approximation in the inner region. Applying similar 
considerations to the hyperfine structure shows that the dependence on 
Z 3 in eqn 6.10 reduces to 



^hfs ~ 3 t y ® 1 h C R°° * (6-17) 

(n*) M p 

The mass ratio arises from /xn/^b = m e /M p . Hyperfine structure scales 
as Z, whereas fine structure scales as Z 2 ; thus Ehfs varies much less 
than Ey s, as the following comparison of the splittings for Na and Cs 
shows. 
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Na, Z = 11 


Cs, Z = 55 


£(3 P 2 P 3 /2)-£(3p 2 P 1/2 ), 


E(6 p 2 P 3/2 ) — E(6p 2 P 1/2 ), 


A/fs = 510 GHz 


A/fs = 16 600 GHz 


For the ground state 3s 2 Si/ 2 , 


For the ground state 6s 2 Si/ 2 , 


A/hfs = 1.8 GHz 


A/hfs = 9.2 GHz 


For 3p 2 Pi/ 2 , 


For 6p 2 Pi/ 2 , 


A/hfs =0.18 GHz 


A/hfs = 1.2 GHz 



The hyperfine splitting of the ground states and the fine-structure 
splitting of the first excited states are indicated on the plot of energies 
against Z in Fig. 6.5. The values shown are only a guideline; e.g. different 




Fig. 6.5 A logarithmic plot of the 
energy of various structures against 
atomic number Z\ the hyperfine split- 
ting of the ground state is plotted with 
data from Fig. 5.7. All the points are 
close to the maximum values of that 
quantity for low-lying configurations, 
terms, levels and hyperfine levels (as 
appropriate) of neutral atoms with one 
or two valence electrons, and these il- 
lustrate how these quantities vary with 
Z . This is only a rough guideline in 
particular cases; higher-lying configura- 
tions in neutral atoms have smaller val- 
ues and in highly-ionized systems the 
structures have higher energies. 



